IGV

This option enables additional files to be associated with the FASTA reference sequence file, as described below. These files are archived in a zip with with a .genome extension. This option also allows the reference sequence to be defined as a directory of FASTA files, rather than a single FASTA.

Prerequisites:

  • Either (1) a FASTA file that contains the sequence data for each chromosome, or (2) a directory.  Directories of zip archives and gzipped FASTAs are no longer supported.
  • A cytoband file, which IGV uses to display the chromosome ideogram.  (Optional)
  • An annotation file, which IGV uses to display the reference gene track. The file can be in BED format, GFF format, or any variation of the genePred table format.  (Optional)
  • An alias file defining alternative names for chromosomes.  (Optional)

Note: If you are choosing files from the NCBI directory, you will generally want to use the .fna or .ffn file (nucleic acid sequences), as opposed to the .faa (amino acids). Choose the .gff file for the annotation file.

Step-by-step:

  1. Click Genomes>Create .genome File. IGV displays the a window where you enter the information.
  2. Enter an ID and a descriptive name for the genome.
  3. Enter the path on your file system or a web URL to the FASTA file for the genome.  If the FASTA file has not already been indexed, an index will be created during the import process. This will generate a file with a “.fai” extension which must be in the same directory as the FASTA file; thus it is necessary that the directory containing the file be writable.
  4. Optionally, specify the cytoband file and the annotation (gene) file.
  5. If the sequence (chromosome) names differ between your FASTA and annotation files, you might need to create an alias file to provide a mapping between the different names. Certain well-known aliases are built into IGV and do not require an alias file. These include mappings that involve adding or removing the prefix “chr” to the name, for example  1 > chr1 and chr1 > 1.  Also, NCBI identifiers of the form  **gi 125745044 ref NC_002229.3  in a FASTA file will be mapped to names of the form NC_002229.3** in the corresponding GFF file.
  6. Click Save. IGV displays the Genome Archive window.
  7. Select the directory in which to save the genome archive (.genome) file and click *Save. IGV saves the genome and loads it into IGV.
CHENTONG
版权声明:本文为博主原创文章,转载请注明出处。
alipay.png WeChatPay.png

CHENTONG

CHENTONG
积微,月不胜日,时不胜月,岁不胜时。凡人好敖慢小事,大事至,然后兴之务之。如是,则常不胜夫敦比于小事者矣!何也?小事之至也数,其悬日也博,其为积也大。大事之至也希,其悬日也浅,其为积也小。故善日者王,善时者霸,补漏者危,大荒者亡!故,王者敬日,霸者敬时,仅存之国危而后戚之。亡国至亡而后知亡,至死而后知死,亡国之祸败,不可胜悔也。霸者之善著也,可以时托也。王者之功名,不可胜日志也。财物货宝以大为重,政教功名者反是,能积微者速成。诗曰:德如毛,民鲜能克举之。此之谓也。

生信宝典文章集锦

### 程序学习心得* [生物信息之程序学习](http://mp.weixin.qq.com/s?__biz=MzI5MTcwNjA4NQ==&mid=2247483927&idx=1&sn=23adf2b9d13400f2081f790e674e...… Continue reading

R统计绘图 - 柱状图

Published on August 12, 2017

R 学习 - 维恩图

Published on August 01, 2017