30 Alignments of EST clusters (Unigenes, Gene Indices) from Arabidopsis EST Cluster (Arabidopsis) 1 {"type": "cdna"} 738 Conserved Domain Database models. CDD 1 {"type": "domain"} 481 Covariance models from Rfam (release 12.2), aligned to the genome with 'cmscan' from the Infernal suite of programs. Models are restricted to those observed in species that share a last common ancestor (LCA). Rfam Models (LCA) 1 {"type": "rna"} 485 Dust is a program that identifies low-complexity sequences (regions of the genome with a biased distribution of nucleotides, such as a repeat). The Dust module is widely used with BLAST to prevent 'sticky' regions from determining false hits. Low complexity (Dust) 1 \N 3 Protein coding genes annotated in ENA Genes 1 {"label_key": "[biotype]", "caption": "Genes", "colour_key": "[biotype]", "default": {"cytoview": "gene_label", "MultiBottom": "collapsed_label", "alignsliceviewbottom": "as_collapsed_label", "MultiTop": "gene_label", "contigviewbottom": "transcript_label", "contigviewtop": "gene_label"}, "name": "Genes", "key": "ensembl"} 8 Assembly gap feature annotated in ENA Assembly gap (ENA) 1 \N 22 exon feature annotated in ENA exon (ENA) 1 {"caption": "Genomic features", "key": "ena_features", "label_key": "[text_label] [display_label]", "multi_name": "Genomic features", "name": "Genomic features"} 17 gene feature annotated in ENA gene (ENA) 1 {"caption": "Genomic features", "key": "ena_features", "label_key": "[text_label] [display_label]", "multi_name": "Genomic features", "name": "Genomic features"} 18 mRNA feature annotated in ENA mRNA (ENA) 1 {"caption": "Genomic features", "key": "ena_features", "label_key": "[text_label] [display_label]", "multi_name": "Genomic features", "name": "Genomic features"} 20 ncRNA genes annotated in ENA ncRNA genes (ENA) 1 {"label_key": "[biotype]", "caption": "Genes", "colour_key": "[biotype]", "default": {"cytoview": "gene_label", "MultiBottom": "collapsed_label", "alignsliceviewbottom": "as_collapsed_label", "MultiTop": "gene_label", "contigviewbottom": "transcript_label", "contigviewtop": "gene_label"}, "name": "Genes", "key": "ensembl"} 759 CATH/Gene3D families. Gene3D 1 {"type": "domain"} 765 \N GOA annotation 0 \N 517 GO term derived transitively from a UniProt record UniProt-derived GO term 0 \N 273 The Gene Ontology XRef projection pipeline GO projected xrefs 0 \N 716 Gene encoding an enzyme annotated at the Plant Reactome. Plant Reactome 1 \N 743 HAMAP families. HAMAP 1 {"type": "domain"} 741 PANTHER families. PANTHER 1 {"type": "domain"} 762 InterPro2GO mapping, defined by InterPro. InterPro2GO mapping 0 \N 33 Alignments of EST clusters (Unigenes, Gene Indices) from Maize EST Cluster (Maize) 1 {"type": "cdna"} 489 RNA genes imported from miRBase . RNA genes 1 {"label_key": "[biotype]", "caption": "Genes", "colour_key": "[biotype]", "default": {"cytoview": "gene_label", "MultiBottom": "collapsed_label", "alignsliceviewbottom": "as_collapsed_label", "MultiTop": "gene_label", "contigviewbottom": "transcript_label", "contigviewtop": "gene_label"}, "name": "Genes", "key": "ensembl"} 752 Intrinsically disordered regions predicted by MobiDB lite. MobiDB lite 1 {"type": "feature"} 754 Coiled-coil regions predicted by Ncoils. Coiled-coils (Ncoils) 1 {"type": "feature"} 26 Percentage of repetitive elements for top level sequences (such as chromosomes, scaffolds, etc.) Repeats (percent) 1 \N 23 Percentage of G/C bases in the sequence. GC content 1 \N 740 Protein domains and motifs from the Pfam database. Pfam 1 {"type": "domain"} 761 Protein domains and motifs from the PROSITE profiles database. PROSITE profiles 1 {"type": "domain"} 748 Protein domains and motifs from the PIR (Protein Information Resource) Superfamily database. PIRSF 1 {"type": "domain"} 745 Protein fingerprints (groups of conserved motifs) from the PRINTS database. Prints 1 {"type": "domain"} 483 Repeats detected using the MIPS Repeat Database (REdat) using RepeatMasker. Repeats: REdat 1 \N 484 Repeats identified by RepeatMasker, using the Repbase library of repeat profiles. Repeats: Repbase 1 \N 490 RNA genes produced by filtering alignments of Rfam (release 12.2) covariance models. RNA genes 1 {"label_key": "[biotype]", "caption": "Genes", "colour_key": "[biotype]", "default": {"cytoview": "gene_label", "MultiBottom": "collapsed_label", "alignsliceviewbottom": "as_collapsed_label", "MultiTop": "gene_label", "contigviewbottom": "transcript_label", "contigviewtop": "gene_label"}, "name": "Genes", "key": "ensembl"} 31 Alignments of EST clusters (Unigenes, Gene Indices) from rice (Oryza) species EST Cluster (Rice) 1 {"type": "cdna"} 760 Protein domains and motifs from the PROSITE patterns database. PROSITE patterns 1 {"type": "domain"} 753 Low complexity peptide sequences identified by Seg. Low complexity (Seg) 1 {"type": "feature"} 34 Genomic alignments of Selaginella "Expressed Sequence Tags" (ESTs) from dbEST Selaginella EST 1 {"gene": {"do_not_display": "1"}, "type": "est"} 32 Alignments of EST clusters (Unigenes, Gene Indices) from Selaginella species EST Cluster (Selaginella) 1 {"type": "cdna"} 749 Structure-Function Linkage Database families. SFLD 1 {"type": "domain"} 751 Signal peptide cleavage sites predicted by SignalP. Cleavage site (Signalp) 1 {"type": "feature"} 744 Protein domains and motifs from the SMART database. SMART 1 {"type": "domain"} 43 Density of single nucleotide polymorphisms (SNPs), calculated by dividing the chromosome into 150 "bins" and counting the SNPs in each. (For very short chromosomes, e.g. MT, some SNPs contribute to multiple bins.) SNP Density 1 \N 746 Protein domains and motifs from the SUPERFAMILY database. Superfamily 1 {"type": "domain"} 750 Protein domains and motifs from the TIGRFAM database. TIGRFAM 1 {"type": "domain"} 755 Transmembrane helices predicted by TMHMM. Transmembrane helices 1 {"type": "feature"} 486 Tandem Repeats Finder locates adjacent copies of a pattern of nucleotides. Tandem repeats (TRF) 1 \N 482 tRNA models predicted with tRNAscan-SE (release 1.3.1). tRNA Models 1 {"type": "rna"} 491 RNA genes produced by filtering predictions from tRNAscan-SE v1.23. RNA genes 1 {"label_key": "[biotype]", "caption": "Genes", "colour_key": "[biotype]", "default": {"cytoview": "gene_label", "MultiBottom": "collapsed_label", "alignsliceviewbottom": "as_collapsed_label", "MultiTop": "gene_label", "contigviewbottom": "transcript_label", "contigviewtop": "gene_label"}, "name": "Genes", "key": "ensembl"} 515 UniParc mapping based on sequence checksums UniParc cross-reference 0 \N 27 Sequences from various databases are matched to Ensembl transcripts using Exonerate. These are external references, or 'Xrefs'. DNA match 0 \N 28 match Protein 0 \N 516 UniProt cross-reference derived transitively from a UniParc identifier UniParc-derived cross-reference 0 \N 518 Cross-reference derived transitively from a UniProt record UniProt-derived cross-reference 0 \N 521 Cross references to RefSeq nucleotide sequences, determined by alignment against the transcriptome with blastx. RefSeq transcripts 0 \N 520 Cross references to RefSeq peptide sequences, determined by alignment against the proteome with blastp. RefSeq peptides 0 \N 519 Cross references to UniProt Swiss-Prot (reviewed) proteins, determined by alignment against the proteome with blastp. UniProt reviewed proteins 0 \N 522 Cross references to UniProt TrEMBL (unreviewed) proteins, determined by alignment against the proteome with blastp. UniProt unreviewed proteins 0 \N