Monarch geneset OGS2.0

DPOGS215558
TranscriptDPOGS215558-TA1137 bp
ProteinDPOGS215558-PA378 aa
Genomic positionDPSCF300129 + 724949-729920
RNAseq coverage1334x (Rank: top 10%)
Annotation
HeliconiusHMEL0061716e-10469.31% 
BombyxBGIBMGA010687-TA2e-18092.65% 
DrosophilaCG6439-PB1e-15371.32% 
EBI UniRef50UniRef50_Q9VD582e-15171.32%CG6439, isoform A n=23 Tax=Eumetazoa RepID=Q9VD58_DROME
NCBI RefSeqXP_973953.26e-16174.22%PREDICTED: similar to CG6439 CG6439-PA [Tribolium castaneum]
NCBI nr blastpgi|1892411411e-15974.22%PREDICTED: similar to CG6439 CG6439-PA [Tribolium castaneum]
NCBI nr blastxgi|1892411416e-15874.22%PREDICTED: similar to CG6439 CG6439-PA [Tribolium castaneum]
Group
Gene OntologyGO:00002871.5e-206magnesium ion binding
GO:00166161.5e-206oxidoreductase activity, acting on the CH-OH group of donors, NAD or NADP as acceptor
GO:00512871.5e-206NAD binding
GO:00551141.5e-206oxidation-reduction process
GO:00060991.6e-143tricarboxylic acid cycle
GO:00044491.6e-143isocitrate dehydrogenase (NAD+) activity
KEGG pathwaytca:6627832e-160 
 K00030 (IDH3)maps-> Citrate cycle (TCA cycle)
InterPro domain[5-377] IPR0018041.5e-206Isocitrate/isopropylmalate dehydrogenase
[50-375] IPR0044341.6e-143Isocitrate dehydrogenase NAD-dependent
[49-377] IPR0240846.1e-106Isopropylmalate dehydrogenase-like domain
Orthology groupMCL13977 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215558-TA
ATGTCTCTTATAACTAGAAACCTGTGTCGTACTCTAGTCCAGGGATCTCAGCATGTCAGCAAAGGTTTGCATACAAGTGCAGTAAACTCTGAAAAAAACGTATGTTTCGCGCCACTATCAATTAGCAGCCTGCAAACAAGCAAGGAAGGTCGCATCAAGTGTACTCTTATACCGGGAGACGGTGTTGGTCCAGAGCTTGTGTATTCGGTACAGGAGGTGTTTAAGGCGACCAGCATTCCTGTTGATTTTGAATCATTCTTTTTCTCTGAAGTTAATCCAACATTGAGTGCGCCTTTAGAAGATGTTGTTAGCTCAATTGCTAGGAATAAGATATCTACCCCGGACTTCTCCCACACTGGTGAACTTCAGACGCTCAACATGAAGCTCCGTAATGCCTTGGATCTGTACGCTAATGTGGTGCATGTCAAGTCACTACCTAATGTCAAATGCAGACACACAGATGTTGATTGTATCATCATAAGAGAACAAACTGAAGGGGAATACTCTGCACTGGAACATGAATCGGTTCCCGGTGTTGTTGAATGTCTCAAGATAATAACGGCTGCTAAGTCTGAACGTATAGCTAAATTCGCTTTTGACTACGCGGTCAAGATGCGCCGTAAGAAGGTCACGGCTGTGCACAAGGCTAACATCATGAAGCTGGGCGACGGATTGTTCCTGAGGAGCTGTGAGGAGATGGCAAAATTATATCCAAGGATACAGTTTGAGAAGATGATTGTTGACAATTGCACGATGCAAATGGTCTCCAACCCGAACCAGTTTGATGTGATGGTGACACCCAACTTGTACGGCAACATAGTGGACAATCTGGCCAGCGGTTTGGTTGGTGGAGCCGGGGTGGTGGCTGGAGCCTCATACAGCGCTGACTGTGCTGTGTTCGAACAGGGTGCTCGTCATATATTCTCTGGTGCTGTCGGTAAGAACATCGCCAATCCGACAGCTATGCTTCTATGCTCGGCCAATTTGCTGTCTCACGTCAATCTGCACTCCTATGCTGATATGATCAAGAACGCTATCAATAAAGTTCTAAAAGACGGCAAGGTGAGAACAAAGGATTTGGGCGGACAGTCCACAACAAAGGACTTCACCAACGCCATCATACACTGCCTCGCTTAG

Protein sequence:

>DPOGS215558-PA
MSLITRNLCRTLVQGSQHVSKGLHTSAVNSEKNVCFAPLSISSLQTSKEGRIKCTLIPGDGVGPELVYSVQEVFKATSIPVDFESFFFSEVNPTLSAPLEDVVSSIARNKISTPDFSHTGELQTLNMKLRNALDLYANVVHVKSLPNVKCRHTDVDCIIIREQTEGEYSALEHESVPGVVECLKIITAAKSERIAKFAFDYAVKMRRKKVTAVHKANIMKLGDGLFLRSCEEMAKLYPRIQFEKMIVDNCTMQMVSNPNQFDVMVTPNLYGNIVDNLASGLVGGAGVVAGASYSADCAVFEQGARHIFSGAVGKNIANPTAMLLCSANLLSHVNLHSYADMIKNAINKVLKDGKVRTKDLGGQSTTKDFTNAIIHCLA-