1. File format:
GFF files or GTF files, supporting gtf/gtf.gz, gff/gff.gz, gff3/gff3.gz as file suffix names.
2. GTF file format:
Comment lines begin with #
The main body has 9 columns, separated by 'tab': seqname source feature start end score strand frame attributes
type: types of annotation information must contain gene,transcript and exon
start/end: need to be less than 231
strand: forward and reverse of strands, represented as + and -, respectively
attributes as the 9th column, whose format is tag "value" , with different attributes separated by space; of which the following four are required.
gene_name value
gene_id value: represents the unique ID of a transcript for the given gene loci of the genome. 'gene_id' and 'value' are separated by space. If the value is empty, it means that there is no corresponding gene.
transcript_name value
transcript_id value : a unique ID to identify a transcript. Empty value means no transcript.
At present, the maximum valid gene number must be less than 220, that is 1048576
Do not disrupt order. The same gene's transcript/exons need to be arranged in order
3. GFF file format:
Comment lines begin with #
The main body has 9 columns, separated by 'tab': seqid source type start end score strand phase attributes
type: types of annotation information must contain gene,mRNA and exon
start/end: max of them need to be less than 231
strand: "+" stands for forward strands, "-" stands for reverse strands, "." indicates there is no need to specify positive or negative strands, "?" means unknown
attributes as the 9th column, whose format is tag=value, with different attributes separated by semicolon
ID Name Parent must provide (Parent is not required for each gene)
For naming rules of the 3rd column, please carefully check on ⇒ "dendrachy" (tree-shaped hierarchy) (do not list 'child' rows without 'parent' rows!) An example is shown as follows:

At present, the maximum valid gene number must be less than 220, that is 1048576
Although ordering is not required, the rules that 'gene' must appear ahead of corresponding mRNA, and mRNA must appear ahead of corresponding exon still need to be met.
4. Others to note:
gene/gene_name should not contain any special symbols (space, all types of brackets, quotation marks, <>, %, etc.) other than common symbols such as "_" and "."
gene/gene_name shorter than 64 characters
Although the mainly used GFF files are version 3 (GFF3), please name them as .gff ; likewise, please name GTF files as .gtf