Monarch geneset OGS2.0

DPOGS207254
TranscriptDPOGS207254-TA1206 bp
ProteinDPOGS207254-PA401 aa
Genomic positionDPSCF300008 - 867100-868305
RNAseq coverage256x (Rank: top 41%)
Annotation
HeliconiusHMEL0073630.091.28% 
BombyxBGIBMGA012121-TA0.087.53% 
DrosophilaCG17712-PA7e-13158.55% 
EBI UniRef50UniRef50_Q9Y1531e-12858.55%BcDNA.GH03377 n=27 Tax=Neoptera RepID=Q9Y153_DROME
NCBI RefSeqXP_970276.14e-14460.10%PREDICTED: similar to oxidoreductase [Tribolium castaneum]
NCBI nr blastpgi|3323757111e-14360.40%unknown [Dendroctonus ponderosae]
NCBI nr blastxgi|3323757112e-13860.40%unknown [Dendroctonus ponderosae]
Group
Gene OntologyGO:00054887.2e-27binding
GO:00164911.5e-14oxidoreductase activity
KEGG pathway 
InterPro domain[5-133] IPR0160407.2e-27NAD(P)-binding domain
[5-112] IPR0006831.5e-14Oxidoreductase, N-terminal
Orthology groupMCL11610 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207254-TA
ATGCTTCCTGGTATCGGTGTATTCGGAACTGGTTCTGTAGCTAAAGTTTTGGTGCCTTTCCTCAGGGAGAAGGGATTTCCAGTCGAAGCTATATGGGGTGTGACGGTTCAAGACGCTGAAAATGCGGCAAAAGAATTAAATATACCTTTCTACACTAATAAAATAGACGACGTTTTATTAAAAAAAAATGTGAGCCTGGTATTTATTGTTTGCGCTCCTAATTTACATGCGCAAATATCAATGAAAGCATTGGGAATAGGTAAACACGTCGTATGTGACAAGCCCGCCGGTCTTTGCCAAGCCGAAGCCTTAAAAATGGTGCGTGCAGCTCAGTATTATCCTACACTAATATCTATCATCAATCATTCCCTAAGATTCTTACCAGCATTTAGCCATATGAGAAAATGTATTATAGATGGTTATTTAGGCAATCCGGGTGACTTGACACTTATTGATGTGAGAGTACAAATGGGATCTCTACTAGGAGAAACATATAATTGGCTATGTGATGACACCATGGGTGGAGGTACGCTCACTTTAGTGGGAAGTCATGTGATAGATTTAGTTTCATATCTCAGTGGGCAGAAAGTTGTGAAAGTACATGGAGTTTTACGAACATTTGTGGATGAGACATCAAAAGTTAATGGCATAAGAAGGATCACAGCTCCTGATTTTTGTACATTCCAATTACAAATGGACAAAGGTTTATTGGTCACAGCAACATTGAACAATCACTTGCCGGGACCTTGTTTTAATCAAGAAATTTATGTTTGCAGTAAGAAGGGTTATTTAGTAGTTCGAGGAGGTGATTTACATGGACGCTTGTATAAGAGTAACTCTAAAAGTGCTTTAGAAGATGAAGGTAAAAGACCTCATGATAAAGAAGAGGTAATTTATGTAGATATTGAAGACTTGAGTTGTGCATCGAGTGTTATCCCAAAACCATATATTAAAGGTCTTTGTAAAATGATAAGTGCGCTTAAGGAAGCATTTCTTCCTGTAAAGGAACAGATGGATTGGATTAAAGAGCCAGTAAGGACCGCAGCCACCTTTGAAGATGGGCAGAGAGTTCAGGCGACCATGGAAGCATTACGCCAATCTGATGAAGATGGTTGCTGGAAAACAGTCCAACTTCTCACTGAACCACCTGACCCGAATCCAGCTTTATCAGCAGCTGTAAGACGCACGGCCATATCATTGCAATAA

Protein sequence:

>DPOGS207254-PA
MLPGIGVFGTGSVAKVLVPFLREKGFPVEAIWGVTVQDAENAAKELNIPFYTNKIDDVLLKKNVSLVFIVCAPNLHAQISMKALGIGKHVVCDKPAGLCQAEALKMVRAAQYYPTLISIINHSLRFLPAFSHMRKCIIDGYLGNPGDLTLIDVRVQMGSLLGETYNWLCDDTMGGGTLTLVGSHVIDLVSYLSGQKVVKVHGVLRTFVDETSKVNGIRRITAPDFCTFQLQMDKGLLVTATLNNHLPGPCFNQEIYVCSKKGYLVVRGGDLHGRLYKSNSKSALEDEGKRPHDKEEVIYVDIEDLSCASSVIPKPYIKGLCKMISALKEAFLPVKEQMDWIKEPVRTAATFEDGQRVQATMEALRQSDEDGCWKTVQLLTEPPDPNPALSAAVRRTAISLQ-