Monarch geneset OGS2.0

DPOGS204223
TranscriptDPOGS204223-TA3138 bp
ProteinDPOGS204223-PA1045 aa
Genomic positionDPSCF300046 - 734794-746244
RNAseq coverage819x (Rank: top 16%)
Annotation
HeliconiusHMEL0151410.077.60% 
BombyxBGIBMGA007502-TA0.082.48% 
Drosophila% 
EBI UniRef50UniRef50_Q134230.058.46%NAD(P) transhydrogenase, mitochondrial n=372 Tax=root RepID=NNTM_HUMAN
NCBI RefSeqXP_970382.10.068.11%PREDICTED: similar to nadp transhydrogenase [Tribolium castaneum]
NCBI nr blastpgi|910836310.068.11%PREDICTED: similar to nadp transhydrogenase [Tribolium castaneum]
NCBI nr blastxgi|1571330170.070.18%nadp transhydrogenase [Aedes aegypti]
Group
Gene OntologyGO:00506613.4e-196NADP binding
GO:00160213.4e-196integral to membrane
GO:00551143.4e-196oxidation-reduction process
GO:00087503.4e-196NAD(P)+ transhydrogenase (AB-specific) activity
GO:00159929.1e-178proton transport
GO:00164912.5e-48oxidoreductase activity
KEGG pathwaytca:6589420.0 
 K00323 (NNT)maps-> Nicotinate and nicotinamide metabolism
InterPro domain[582-1039] IPR0121363.4e-196NADP transhydrogenase, beta subunit
[41-549] IPR0045719.1e-178NAD(P) transhydrogenase, alpha subunit
[185-348] IPR0076982.5e-48Alanine dehydrogenase/PNT, C-terminal
[42-175] IPR0078862e-44Alanine dehydrogenase/PNT, N-terminal
Orthology groupMCL17748 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204223-TA
ATGTGCGGATGTGTGAGGGTACTCCGCCTTGGCAGTCCACTAAAAAAATCATGGCAACGCAGATTGTTTTCTTCATCTTCTTCGCCGCCAACGCAAGGAGTGCCATACACCAAACTTAGTGTCGGAATTCCCAAAGAGATATGGCAAGATGAAAGGAGAGTTGCCATAGTACCAGCTGTAGTAAGTAAATTAGTTAAAAAAGGTTTTACAGTTAATGTAGAAGAAAATGCTGGTACATTAGCAAATTTTCCAAATAAGACTTATGAAGAAGTTGGAGCTAAAATAACTAGTGTAAAGGACGCCTACGGCTCAAATATAATTCTTAAAGTACGTCCGTTGGCTGAAAGCGAACTTCAAAATGTTAACGAGGAATCAACGCTTATATCATTTTTCTACCCAGCACAAAACCAAGCACTTATACAAAAATTGGCTGCTAAAAAGGTGAATGCATTTGCAATGGACTGCATTCCTCGGATAAGCCGTGCGCAGGTGTTCGACGCGTTGAGCTCTATGGCGAACGTTGCTGGCTATCGAGCTGTCATTGAAGCCGCAGCACATTTCCCACGTTTCTTCTCCGGCCAGATGACGGCAGCGGGTCGCGTGCCGCCATGCCGCGTGTTGGTGGTTGGCGGAGGGGTAGCGGGGTTGGCGGCCGCGGCCCAGGCGAGATGTATGGGGGCCGCAGTCCGGGCCTTCGACACACGACCGGCTGTGAGGGAACAGATCGAGAGTCTCGGTGCACAGTTCATTACTATGGAAGTTAAGGAGGAGGGGGCTGGTGCTGGTGGGTACGCAAAGGAAATGAGTGAGGAGTTCCTTCAGGCGGAGCGTGCTTTGTTGGGACGGGAGGCTCGGAATTCAGACGTCGTGATCAGCACAGCACTCATACCAGGGAAACCGGCGCCGCTGCTCATATTAGAGGATGCTGTCAAAGATATGGCTCCTGGCAGCGTGATAGTTGATCTAGCCGCTGAGATGGGTGGAAACATTGAGACGACCACTAAGGGCAAGGTGACCAGGGTTCATGACGTCACACACATCGGCCTCACAGACCTACCAAGTCGAATGCCCGCACACGCCTCCACACTCTACGCCAACAATATTTCTGCATTCTTATTAAGTTTAGGTACTAACGATCACTTCCACATCAATCTGGAGGATGAAGTGACTCGTGGGGCGATAGTCCTTAAAGCTGGTGAATTACTATGGCCACCGCCGCCCGCACCCTTGGTTGCACCCGACGCGGCCCCCAAAACTGTAACCCCTGTCAAGGTCGAGCCTCCCAACCCCTTCAATGAAACCTTGAAGGATACCTTCTTATATTCTACCGGTCTGGCAAGTCTTATCGGTCTCGGTATGGCATCGCCGAATCCGGCCTTCACCACTATGACCACCACCTTGGCTTTATCTGGTGTCGTGGGTTACCACACGGTGTGGGGCGTGGTGCCGGCGCTGCACTCTCCTTTGATGTCCGTCACTAACGCCGTGTCGGGCATTACGGCTGTGGGTGGACTACTGTTGATGGGAGGAGGATATCTGCCAGAAACACCTGTACAGTGGCTAGCGAGTACAGCGGCTTTGATCTCCTTTGTCAACGTATTCGGCGGGTTCATGGTCACACAGCGTATGTTGGATATGTTCAAAAGGCCAGGTGATCCGCCAGAGTATGGATATCTGTACGCTATACCTGCTGCCGCGCTTTTGGGAGGATACATCACAACAGCGATGCAGGGTTACCCTGAAGTCCACCAGATGGCGTACCTAGCTTCGTCGTTATGCTGCGTCGGAGCACTCGCCGGCCTGAGCTCACAGACGACAGCCAGGAAGGGAAACTATTTGGGAATGATTGGTGTATCCGGCGGTATAGCGGCCACACTGGGAGCATTGACTCCAACATCCGAAGTATTGGCGCAAATGGTTGGCGTGGCGGGCATTGGCGGTCTACTTGGTGGTGTCATCGCTAAGAAAATTGAAATCACTGATTTGCCACAACTTGTGGCTGGATTCCACAGCTTGGTGGGCATGGCCGCTGTATTAACATGTCTAGCGACGTACATGCACGACTTCCCCGCCATGGCGCTGGACCCCACCGCCGCCACGCTCAAGACGTCTCTCTTCCTCGGCACATACATCGGTGGAATAACATTCACTGGGTCGTTAGTGGCTTACGGTAAACTTCAAGGCGTGTTGTCCTCGGCCCCACTATTACTGCCGGGTCGTCATGCTCTGAACGCGGCGCTGGCCACGGGAGCCCTGGGCTGTGGCGGAGCCCTGCTCGCCTTCCCCGAAGCCCCCGGCCTGCCGCTACTGTCCGCCGCCGCCGTCCTCAGCGGCATCCAGGGCCTCACACTCACATCTGCTATTGGTGGTGCTGACATGCCGGTGGTGATCACAGTCCTGAACAGCTACTCCGGCTGGGCGCTGTGTGCTGAAGGGTTCATGTTGAACAACTCCCTCATGACCATCGTCGGCGCACTCATCGGCAGCTCCGGAGCCATTCTATCTTATATAATGTGCAAGGCTATGAATCGTTCGTTGCCGAATGTAATCCTAGGTGGGTACGGTGTAACGGGGTCCGGATCAGCTCGCCCGGAAGGCGCCACCCACACCGAGATGAACGTCGACAGCGTAGCAGACCTCGTTCACCGCGCCTCCTCCATCATTATAACTCCCGGTTATGGTCTGTGTGTGGCCAAAGCTCAGTATCCCATCGCCGAATTGGTGGACATTCTTAAAGGCATCGGAAAAAAAGTGCGCTTTGCTATACATCCAGTTGCTGGACGTATGCCCGGTCAACTGAACGTGCTGCTCGCTGAAGCCGGTGTGCCCTATGACGACGTGTTCGAGATGGAGGAAATCAACGATGAATTCCCGGAAACTGACTTGGTCTTAGTTATAGGCGCCAACGACACCGTGAACAGTGCTGCTGAGGACGACCCGGAGTCTCCCATAGCCGGCATGCCGGTGCTCAAAGTGTGGAAGGCGAACCAAGTGGTAGTGATGAAGAGGTCTATGGGTGTCGGCTACGCGGCAGTTGATAACCCCATATTCTACAACCCCAACACCGCCATGTTGTTGGGAGACGCCAAGAAGACTTGCGACGCACTTCTCGACAGAATCAAACATCTCGCTGCATAA

Protein sequence:

>DPOGS204223-PA
MCGCVRVLRLGSPLKKSWQRRLFSSSSSPPTQGVPYTKLSVGIPKEIWQDERRVAIVPAVVSKLVKKGFTVNVEENAGTLANFPNKTYEEVGAKITSVKDAYGSNIILKVRPLAESELQNVNEESTLISFFYPAQNQALIQKLAAKKVNAFAMDCIPRISRAQVFDALSSMANVAGYRAVIEAAAHFPRFFSGQMTAAGRVPPCRVLVVGGGVAGLAAAAQARCMGAAVRAFDTRPAVREQIESLGAQFITMEVKEEGAGAGGYAKEMSEEFLQAERALLGREARNSDVVISTALIPGKPAPLLILEDAVKDMAPGSVIVDLAAEMGGNIETTTKGKVTRVHDVTHIGLTDLPSRMPAHASTLYANNISAFLLSLGTNDHFHINLEDEVTRGAIVLKAGELLWPPPPAPLVAPDAAPKTVTPVKVEPPNPFNETLKDTFLYSTGLASLIGLGMASPNPAFTTMTTTLALSGVVGYHTVWGVVPALHSPLMSVTNAVSGITAVGGLLLMGGGYLPETPVQWLASTAALISFVNVFGGFMVTQRMLDMFKRPGDPPEYGYLYAIPAAALLGGYITTAMQGYPEVHQMAYLASSLCCVGALAGLSSQTTARKGNYLGMIGVSGGIAATLGALTPTSEVLAQMVGVAGIGGLLGGVIAKKIEITDLPQLVAGFHSLVGMAAVLTCLATYMHDFPAMALDPTAATLKTSLFLGTYIGGITFTGSLVAYGKLQGVLSSAPLLLPGRHALNAALATGALGCGGALLAFPEAPGLPLLSAAAVLSGIQGLTLTSAIGGADMPVVITVLNSYSGWALCAEGFMLNNSLMTIVGALIGSSGAILSYIMCKAMNRSLPNVILGGYGVTGSGSARPEGATHTEMNVDSVADLVHRASSIIITPGYGLCVAKAQYPIAELVDILKGIGKKVRFAIHPVAGRMPGQLNVLLAEAGVPYDDVFEMEEINDEFPETDLVLVIGANDTVNSAAEDDPESPIAGMPVLKVWKANQVVVMKRSMGVGYAAVDNPIFYNPNTAMLLGDAKKTCDALLDRIKHLAA-