Frequently Asked Questions
Analysis
Q: How can I do a comparative genomic analysis between an genome currently in IMG and an genome that JGI just sent me but is not yet ready for public release?
A: If an genome is not yet in IMG you cannot compare it with other IMG genomes. However, IMG is updated quarterly, so once a new genome is available for public release, it will also appear in the next release of IMG.
Q: How can I analyze my genome in IMG while keeping it private until it is published?
A: If your genome sequence must be kept confidential until a publication is prepared, we can load the sequence into our password protected internal IMG version and provide you password access to this version for analysis and annotation purposes.
Content
Q: I can download the raw nucleotide sequence of the contigs of my microbe of interest. Can I also download your annotation of the contigs as they appear on your website? (In GenBank format for example?)
A: You can download DNA sequence for all scaffolds in FASTA format, amino acid sequences for all proteins coded by genes in FASTA format, and gene information with some annotation in tab delimited format readable by Excel. You can also get to the ORNL annotations (html and tab files) link, from IMG's Genome detail page, by following the link to
JGI's Genome Portal Page
which has a link to ORNL's annotatons. A GenBank or EMBL format may be
generated with MyIMG or IMG Terms optionally replacing the original
gene product name.
Q: How can I add my genome into IMG?
A: You can contact us via the
Questions/Comments
form requesting the addition of your genome to IMG.
Q: Why are some published complete genomes missing from IMG?
A: IMG is updated on a quarterly basis. Therefore we collect all the publicly available complete microbial genomes, one month before the actual release and include them in IMG. Our data source is EBI-GR for most bacterial and archeal complete genomes.
Q:
How accurate are the predictions of genes and RNAs for the draft genomes?
A:Any analysis based on draft sequence data is preliminary and
may contain errors.
Similar to the problems in the identification of the genes,
the rRNA gene calls for draft
assemblies may not be an accurate reflection of the
number of rRNA operons present in
the genome.
This is mainly due to assembly errors caused by repeats such as rRNA operons,
often resulting in partial rRNA genes at the end of contigs.
System
Q: Is the IMG available for local installations so that we could incorporate our own microbial genome data into IMG on our own servers?
A: IMG is not available for local installations. Resources required for making the IMG collection of tools and databases ready for installation at other sites are not available at this time.
Q: How is IMG different from other systems like PUMA, or SEED?
A: While IMG shares goals with systems such as PUMA and SEED (see
IMG Lineage),
IMG's main goal is to support the analysis of the genomes sequenced at JGI.
User Interface
Q: Why was the structure of the Web user interface changed for the new version of IMG?
A: The new User Interface for IMG has been structured so that it reflects the key three dimensions of the microbial genome data space: genomes, functional annotations, and genes. Thus, selections for each such dimensions have been grouped under
"Find Genes", "Find Genomes", and "Find Functions",
respectively. The main comparative analysis tools have been grouped under
"Compare Genomes". Preferences have been included into MyIMG. The
User Interface Map
provides a graphical view of the main IMG pages and functions in the new user interface.
Q: Where is the Phylogenetic Profiler?
A: The Phylogenetic Profiler is a special type of gene selection, and therefore has been included under "Find Genes".
Q: How many types of profiles are supported by IMG and how can I find them?
A:
"Profile" is a generic term in IMG to describe
analysis of genes or functions in one dimension, and
genomes in another. There are two
major classes. Genes (on a locus)
are organism dependent entities whereas functions are
organism independent. Genes are related to other genes
through pairwise relationships, such as those
that via bidirectional best hits or simple
BLAST sequence similarity.
Functions may be coded by membership in
COG, Pfam, Enzyme, TIGRfam, IMG term or pathways
and establishes relationships through ontological
identity. A number of operations for both
of these types of profiles
are are described in the IMG documentation.
