Monarch geneset OGS2.0

DPOGS200448
TranscriptDPOGS200448-TA1167 bp
ProteinDPOGS200448-PA388 aa
Genomic positionDPSCF300260 - 288625-292791
RNAseq coverage5649x (Rank: top 2%)
Annotation
HeliconiusHMEL0062983e-16681.89% 
BombyxBGIBMGA011412-TA0.087.11% 
DrosophilaCG5028-PA2e-15169.12% 
EBI UniRef50UniRef50_D2Y0610.086.86%Isocitrate dehydrogenase n=7 Tax=Pancrustacea RepID=D2Y061_BOMMO
NCBI RefSeqNP_001165386.10.086.86%isocitrate dehydrogenase [Bombyx mori]
NCBI nr blastpgi|2848135610.086.86%isocitrate dehydrogenase [Bombyx mori]
NCBI nr blastxgi|2848135610.086.86%isocitrate dehydrogenase [Bombyx mori]
Group
Gene OntologyGO:00002873.4e-196magnesium ion binding
GO:00166163.4e-196oxidoreductase activity, acting on the CH-OH group of donors, NAD or NADP as acceptor
GO:00512873.4e-196NAD binding
GO:00551143.4e-196oxidation-reduction process
GO:00060991.3e-150tricarboxylic acid cycle
GO:00044491.3e-150isocitrate dehydrogenase (NAD+) activity
KEGG pathwaytca:6629033e-163 
 K00030 (IDH3)maps-> Citrate cycle (TCA cycle)
InterPro domain[42-389] IPR0018043.4e-196Isocitrate/isopropylmalate dehydrogenase
[54-384] IPR0044341.3e-150Isocitrate dehydrogenase NAD-dependent
[51-386] IPR0240841.6e-111Isopropylmalate dehydrogenase-like domain
Orthology groupMCL14031 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200448-TA
ATGGCAGTGAGGTTGCTCTCGAAAGTGAAATGCCTGCCTGGGATACAGGGAGTGGTATACGCTAGTGGGGCGGCTGCTCCAGCTCAGCTTTCAGATTTTGAGTTACAGCATAAGACGCCAGTAATTCGTAAGCAGAAGAACATCCCCATAGCTCAGTACGGAGGCCGTCATGCCGTCACCATGCTCCCTGGTGGTGGAATCGGCCCTGAATGCATGGGCTACGTCAGGGAAATTTTCAAGTACATCGGTGCCCCCATAGACTTTGAACTTGTGAATATTGATCCTAATGTAGACAATGATGACGACGTCCAATATGCCATCACCACCATCAAGAGGAATGGTGTCGGACTTAAGGGTAATATTGAGACTAAGAGCGAGGCAGCGTACGTAACTTCACGTAATGTTGCGTTACGTAACGAGTTAGACATGTACGCATATGTTTTGAACTGCAAATCATTTCCCGGAGTCAGCACCAGGCACAAGGATATTGACATCGTTATTATAAGACAGAACACAGAAGGTGAATATGCCATGTTGGAACATGAATCTGTTAGGGGGGTGATTGAATCGATGAAGGTGGTCACAGCGAGCAACTCGGAGAGGGTTGCCAGATTCGCTTTTGAATTTGCCAAGAGGAATGGAAGGAAGAAGGTAACGACTGTCCACAAAGCGAATATCATGAAGCTATCAGATGGGCTGTTCCTGGAGACATCCCGTCGTTTGGCTCAAGAGTATCCGGACATAGAACATAATGATATGATCATTGACAACTGTTGTATGCAACTTGTTGCCAGGCCGCACCAGTTTGACGTGATGTTGATGACAAATCTGTATGGATCAATTGTCTCTAACGTGGTTTGTGGTCTACTCGGAGGAGCTGGTTTACTCTCCGGGAGGAATTATGGTGACAACTACGCAGTCTTTGAACCTGGCACTAGGAATACTGGTACAGCCATAGCTGGCAAGAACATTGCTAACCCAATAGCCATGATAAACGCCTCGGTGGACATGTTGGAGCACCTCGGACACCATTATCATGCCGGTTTGATCAGGAGAGCGTTGGATAAAACTATTAATACCGATAGAGTGCTCACCCCTGACTGCGGAGGAACAGCCAGTTCCAGTGAAGTGGTTGACAGCATCATGCAGAATATTGGCCGCTGCTAG

Protein sequence:

>DPOGS200448-PA
MAVRLLSKVKCLPGIQGVVYASGAAAPAQLSDFELQHKTPVIRKQKNIPIAQYGGRHAVTMLPGGGIGPECMGYVREIFKYIGAPIDFELVNIDPNVDNDDDVQYAITTIKRNGVGLKGNIETKSEAAYVTSRNVALRNELDMYAYVLNCKSFPGVSTRHKDIDIVIIRQNTEGEYAMLEHESVRGVIESMKVVTASNSERVARFAFEFAKRNGRKKVTTVHKANIMKLSDGLFLETSRRLAQEYPDIEHNDMIIDNCCMQLVARPHQFDVMLMTNLYGSIVSNVVCGLLGGAGLLSGRNYGDNYAVFEPGTRNTGTAIAGKNIANPIAMINASVDMLEHLGHHYHAGLIRRALDKTINTDRVLTPDCGGTASSSEVVDSIMQNIGRC-