Monarch geneset OGS2.0

DPOGS203718
TranscriptDPOGS203718-TA735 bp
ProteinDPOGS203718-PA244 aa
Genomic positionDPSCF300010 - 1150756-1153398
RNAseq coverage41x (Rank: top 72%)
Annotation
HeliconiusHMEL0059517e-8867.70% 
BombyxBGIBMGA003691-TA5e-7673.84% 
DrosophilaCG5703-PA3e-6450.67% 
EBI UniRef50UniRef50_P194042e-6554.72%NADH dehydrogenase [ubiquinone] flavoprotein 2, mitochondrial n=94 Tax=cellular organisms RepID=NDUV2_HUMAN
NCBI RefSeqNP_001040535.11e-6554.19%NADH-ubiquinone reductase [Bombyx mori]
NCBI nr blastpgi|3387280304e-6555.19%PREDICTED: NADH dehydrogenase [ubiquinone] flavoprotein 2, mitochondrial-like [Equus caballus]
NCBI nr blastxgi|3387280306e-6355.19%PREDICTED: NADH dehydrogenase [ubiquinone] flavoprotein 2, mitochondrial-like [Equus caballus]
Group
Gene OntologyGO:00512871.2e-96NAD binding
GO:00551141.2e-96oxidation-reduction process
GO:00164911.2e-96oxidoreductase activity
KEGG pathwayecb:1000544458e-66 
 K03943 (NDUFV2)maps-> Huntington's disease
    Oxidative phosphorylation
    Alzheimer's disease
    Parkinson's disease
InterPro domain[15-240] IPR0020231.2e-96NADH:ubiquinone oxidoreductase, 24kDa subunit
[45-221] IPR0123363.1e-45Thioredoxin-like fold
Orthology groupMCL24934 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203718-TA
ATGTTCAAAATTATTGATCTTATCAAGAATCCTCGATTGAATAGAAACATTTGTACTTCAGGAAGGTTATGGAGTGAAGAGCTTTTTGCTCACAGAGACAGTAAAGAAAACAATCCCAATACGCCATTTGACTTTTCTCAAGCAAATTTAAGGAGACTAACAGCTATTATTCAAAATTATCCCGAAGGGGCTCAGCGTTCTGCGCTTGGAGCAGCAATGGATATTGTTCAAAGGCAAATCGGATGGATACCCATATCAGCTATGCACAAAGTGGCCGATATTCTCAGTATTCCCCGTATGAGAGTATACGAATGGGCTACATTCTATACTATGAATAAAAGAAGATTCCGGGGAAAATTCAATGTGAAAGTTTGTATAACGACTCCTTGTATGCTGCGTGGATCGGACATCATTTTAGCAGCGGCAGAAGCGGCAACCCGCTGCCGTGTGGGAGGTCTCTCCAGTGACAAAATGTTCGGCGTGGATGTCGTGCAATGCCAGGGCGCGTGTGTCAATGCACCCGTTCTCGTTGTAGACGACGATTATTATGAAGATGTTACTGTATGTGATGTGAATGAAATTATACAGACTTTAAGAAATGGAGGTATACCACCTTGGGGCCCTCGCTCCGGTAGGACTTCCTGTGAACCAATAACAGGCCAAACGACCTTATGTGAATATCCTCCTCAACCAGGCTATGGTCTCCAACCATCTTTATGTGGCCGGTGCAATTAG

Protein sequence:

>DPOGS203718-PA
MFKIIDLIKNPRLNRNICTSGRLWSEELFAHRDSKENNPNTPFDFSQANLRRLTAIIQNYPEGAQRSALGAAMDIVQRQIGWIPISAMHKVADILSIPRMRVYEWATFYTMNKRRFRGKFNVKVCITTPCMLRGSDIILAAAEAATRCRVGGLSSDKMFGVDVVQCQGACVNAPVLVVDDDYYEDVTVCDVNEIIQTLRNGGIPPWGPRSGRTSCEPITGQTTLCEYPPQPGYGLQPSLCGRCN-