GFF tools
News
May 11, 2011: Abstract about the GFF tools accepted for a poster presentation at BOSC 2011.
Introduction
A collection of tools gives a better understanding about data in a Generic feature format Version 3. The programs are available at our Galaxy server.
The stand-alone versions of each tool along with Galaxy wrappers are available here.
Requirements
Python, Perl
Dependencies
Contents
The tools in the first section validate the GFF file and returns a detailed report on the file contents. They are:
- Examine
- FeatureScan
- Comprehensive GFF3 Validator
- MergeLoci
The second section mainly deals with converting between GTF, BED, GenBank and GFF3 formats, Which includes:
- GTF_to_GFF3
- GFF3_to_GTF
- BED_to_GFF3
- GFF3_to_BED
- GenBank_to_GFF3
Detailed description and usage
a(1). Examine
Provides the features(gene, mRNA, UTR's, exon, CDS etc) relationship based on their identifier mapping in a genome annotation file.
python gff_idmapper.py example.gff3
a(2). FeatureScan
Provides an overview of available contig identifiers, sources and feature types and their corresponding counts from a given genome annotation file.
python gff_featurescan.py example.gff3
a(3). Comprehensive GFF3 Validator
Provides a comprehensive scanning over genome annotation file, for example: Consistency check for contig ID present in provided genome sequence file (FASTA) and GFF3.
python comprehensive_check.py example.gff3
a(4). MergeLoci
MergeLoci will be able to merge several transcripts from single loci to a main feature instead of several features.
python gff_mergeloci.py example.gff3
b(1). GTF_to_GFF3
Converts data from Gene Transfer Format to a valid Generic Feature Format Version 3.
python gtf_to_gff3_converter.py -q example.gtf -o example.gff3
b(2). GFF3_to_GTF
Converts data from Generic Feature Format Version 3 to Gene Transfer Format
perl gff3_to_gtf_converter.pl example.gff3 example.gtf
b(3). BED_to_GFF3
Converts data from a 12 column UCSC wiggle Browser Extensible Data (BED) format to Generic Feature Format Version 3.
python bed_to_gff3_converter.py -q example.bed -o example.gff3
b(4). GFF3_to_BED
Converts data from Generic Feature Format Version 3 (GFF3) to UCSC wiggle 12 column Browser Extensible Data.
python gff3_to_bed_converter.py -q example.gff3 -o example.bed
b(5). GenBank_to_GFF3
Converts annotations in GenBank format to Generic Feature Format Version 3 (GFF3).
perl genbank2gff3.pl -f GenBank example.gbk -out stdout > example.gff3
More information about each tool can be found at Galaxy tool page. Questions/Comments/Suggestions, I can be reached at vipin.ts@tue.mpg.de