508 Conserved Domain Database models. CDD 1 {"type": "domain"} 252 Covariance models from Rfam (release 12.2), aligned to the genome with 'cmscan' from the Infernal suite of programs. Models are restricted to those observed in species that share a last common ancestor (LCA). Rfam Models (LCA) 1 {"type": "rna"} 24 Density of coding genes, calculated by dividing the chromosome into 150 "bins" and counting the genes in each. (For very short chromosomes, e.g. MT, some genes contribute to multiple bins.) Coding genes (density) 1 \N 254 Dust is a program that identifies low-complexity sequences (regions of the genome with a biased distribution of nucleotides, such as a repeat). The Dust module is widely used with BLAST to prevent 'sticky' regions from determining false hits. Low complexity (Dust) 1 \N 1 Protein coding genes annotated in ENA Genes 1 {"label_key": "[biotype]", "caption": "Genes", "colour_key": "[biotype]", "default": {"cytoview": "gene_label", "MultiBottom": "collapsed_label", "alignsliceviewbottom": "as_collapsed_label", "MultiTop": "gene_label", "contigviewbottom": "transcript_label", "contigviewtop": "gene_label"}, "name": "Genes", "key": "ensembl"} 12 3'UTR feature annotated in ENA 3'UTR (ENA) 1 {"caption": "Genomic features", "key": "ena_features", "label_key": "[text_label] [display_label]", "multi_name": "Genomic features", "name": "Genomic features"} 16 5'UTR feature annotated in ENA 5'UTR (ENA) 1 {"caption": "Genomic features", "key": "ena_features", "label_key": "[text_label] [display_label]", "multi_name": "Genomic features", "name": "Genomic features"} 13 Assembly gap feature annotated in ENA Assembly gap (ENA) 1 \N 26 gene feature annotated in ENA gene (ENA) 1 {"caption": "Genomic features", "key": "ena_features", "label_key": "[text_label] [display_label]", "multi_name": "Genomic features", "name": "Genomic features"} 21 intron feature annotated in ENA intron (ENA) 1 {"caption": "Genomic features", "key": "ena_features", "label_key": "[text_label] [display_label]", "multi_name": "Genomic features", "name": "Genomic features"} 27 misc_feature feature annotated in ENA misc_feature (ENA) 1 {"caption": "Genomic features", "key": "ena_features", "label_key": "[text_label] [display_label]", "multi_name": "Genomic features", "name": "Genomic features"} 20 misc_RNA feature annotated in ENA misc_RNA (ENA) 1 {"caption": "Genomic features", "key": "ena_features", "label_key": "[text_label] [display_label]", "multi_name": "Genomic features", "name": "Genomic features"} 25 mRNA feature annotated in ENA mRNA (ENA) 1 {"caption": "Genomic features", "key": "ena_features", "label_key": "[text_label] [display_label]", "multi_name": "Genomic features", "name": "Genomic features"} 19 ncRNA genes annotated in ENA ncRNA genes (ENA) 1 {"label_key": "[biotype]", "caption": "Genes", "colour_key": "[biotype]", "default": {"cytoview": "gene_label", "MultiBottom": "collapsed_label", "alignsliceviewbottom": "as_collapsed_label", "MultiTop": "gene_label", "contigviewbottom": "transcript_label", "contigviewtop": "gene_label"}, "name": "Genes", "key": "ensembl"} 529 CATH/Gene3D families. Gene3D 1 {"type": "domain"} 17 Cross-references attached by GenomeLoader to provide names GenomeLoader name cross-references 0 \N 2 Cross-references attached by GenomeLoader GenomeLoader cross-references 0 \N 535 \N GOA annotation 0 \N 288 GO term derived transitively from a UniProt record UniProt-derived GO term 0 \N 36 The Gene Ontology XRef projection pipeline GO projected xrefs 0 \N 295 Gene encoding an enzyme annotated at the Plant Reactome. Plant Reactome 1 \N 521 HAMAP families. HAMAP 1 {"type": "domain"} 509 PANTHER families. PANTHER 1 {"type": "domain"} 531 InterPro2GO mapping, defined by InterPro. InterPro2GO mapping 0 \N 115 Density of long non-coding RNA genes, calculated by dividing the chromosome into 150 "bins" and counting the genes in each. (For very short chromosomes, e.g. MT, some genes contribute to multiple bins.) Long non-coding genes (density) 1 \N 260 RNA genes imported from miRBase . RNA genes 1 {"label_key": "[biotype]", "caption": "Genes", "colour_key": "[biotype]", "default": {"cytoview": "gene_label", "MultiBottom": "collapsed_label", "alignsliceviewbottom": "as_collapsed_label", "MultiTop": "gene_label", "contigviewbottom": "transcript_label", "contigviewtop": "gene_label"}, "name": "Genes", "key": "ensembl"} 520 Intrinsically disordered regions predicted by MobiDB lite. MobiDB lite 1 {"type": "feature"} 524 Coiled-coil regions predicted by Ncoils. Coiled-coils (Ncoils) 1 {"type": "feature"} 113 Percentage of repetitive elements for top level sequences (such as chromosomes, scaffolds, etc.) Repeats (percent) 1 \N 22 Percentage of G/C bases in the sequence. GC content 1 \N 510 Protein domains and motifs from the Pfam database. Pfam 1 {"type": "domain"} 532 Protein domains and motifs from the PROSITE profiles database. PROSITE profiles 1 {"type": "domain"} 513 Protein domains and motifs from the PIR (Protein Information Resource) Superfamily database. PIRSF 1 {"type": "domain"} 514 Protein fingerprints (groups of conserved motifs) from the PRINTS database. Prints 1 {"type": "domain"} 114 Density of pseudogenes, calculated by dividing the chromosome into 150 "bins" and counting the genes in each. (For very short chromosomes, e.g. MT, some genes contribute to multiple bins.) Pseudogenes (density) 1 \N 256 Repeats detected using the MIPS Repeat Database (REdat) using RepeatMasker. Repeats: REdat 1 \N 257 Repeats identified by RepeatMasker, using the Repbase library of repeat profiles. Repeats: Repbase 1 \N 261 RNA genes produced by filtering alignments of Rfam (release 12.2) covariance models. RNA genes 1 {"label_key": "[biotype]", "caption": "Genes", "colour_key": "[biotype]", "default": {"cytoview": "gene_label", "MultiBottom": "collapsed_label", "alignsliceviewbottom": "as_collapsed_label", "MultiTop": "gene_label", "contigviewbottom": "transcript_label", "contigviewtop": "gene_label"}, "name": "Genes", "key": "ensembl"} 28 Non-coding RNA gene models based on alignment by of RFAM families to genomic sequences (alignments provided by RFAM) RFAM 1 {"caption": "RFAM Genes", "colour_key": "[biotype]", "default": {"MultiBottom": "collapsed_label", "MultiTop": "gene_label", "alignsliceviewbottom": "as_collapsed_label", "contigviewbottom": "transcript_label", "contigviewtop": "gene_label", "cytoview": "gene_label"}, "key": "rfam_genes", "label_key": "[biotype]", "multi_caption": "RFAM Genes", "name": "Non-coding RNA genes based on RFAM alignments"} 530 Protein domains and motifs from the PROSITE patterns database. PROSITE patterns 1 {"type": "domain"} 523 Low complexity peptide sequences identified by Seg. Low complexity (Seg) 1 {"type": "feature"} 515 Structure-Function Linkage Database families. SFLD 1 {"type": "domain"} 23 Density of short non-coding RNA genes, calculated by dividing the chromosome into 150 "bins" and counting the genes in each. (For very short chromosomes, e.g. MT, some genes contribute to multiple bins.) Short non-coding genes (density) 1 \N 525 Signal peptide cleavage sites predicted by SignalP. Cleavage site (Signalp) 1 {"type": "feature"} 517 Protein domains and motifs from the SMART database. SMART 1 {"type": "domain"} 518 Protein domains and motifs from the SUPERFAMILY database. Superfamily 1 {"type": "domain"} 519 Protein domains and motifs from the TIGRFAM database. TIGRFAM 1 {"type": "domain"} 522 Transmembrane helices predicted by TMHMM. Transmembrane helices 1 {"type": "feature"} 255 Tandem Repeats Finder locates adjacent copies of a pattern of nucleotides. Tandem repeats (TRF) 1 \N 253 tRNA models predicted with tRNAscan-SE (release 1.3.1). tRNA Models 1 {"type": "rna"} 262 RNA genes produced by filtering predictions from tRNAscan-SE v1.23. RNA genes 1 {"label_key": "[biotype]", "caption": "Genes", "colour_key": "[biotype]", "default": {"cytoview": "gene_label", "MultiBottom": "collapsed_label", "alignsliceviewbottom": "as_collapsed_label", "MultiTop": "gene_label", "contigviewbottom": "transcript_label", "contigviewtop": "gene_label"}, "name": "Genes", "key": "ensembl"} 286 UniParc mapping based on sequence checksums UniParc cross-reference 0 \N 287 UniProt cross-reference derived transitively from a UniParc identifier UniParc-derived cross-reference 0 \N 289 Cross-reference derived transitively from a UniProt record UniProt-derived cross-reference 0 \N 292 Cross references to RefSeq nucleotide sequences, determined by alignment against the transcriptome with blastx. RefSeq transcripts 0 \N 291 Cross references to RefSeq peptide sequences, determined by alignment against the proteome with blastp. RefSeq peptides 0 \N 290 Cross references to UniProt Swiss-Prot (reviewed) proteins, determined by alignment against the proteome with blastp. UniProt reviewed proteins 0 \N 293 Cross references to UniProt TrEMBL (unreviewed) proteins, determined by alignment against the proteome with blastp. UniProt unreviewed proteins 0 \N