Monarch geneset OGS2.0

DPOGS203622
TranscriptDPOGS203622-TA3708 bp
ProteinDPOGS203622-PA1235 aa
Genomic positionDPSCF300063 + 639457-651387
RNAseq coverage991x (Rank: top 13%)
Annotation
HeliconiusHMEL0088850.073.56% 
BombyxBGIBMGA007293-TA0.067.87% 
DrosophilaCG17018-PF3e-6428.39% 
EBI UniRef50UniRef50_D6WWB55e-13431.02%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WWB5_TRICA
NCBI RefSeqXP_972723.21e-14831.51%PREDICTED: similar to limkain b1 [Tribolium castaneum]
NCBI nr blastpgi|1892401012e-14731.51%PREDICTED: similar to limkain b1 [Tribolium castaneum]
NCBI nr blastxgi|3454841841e-15933.03%PREDICTED: LOW QUALITY PROTEIN: limkain-b1-like [Nasonia vitripennis]
Group
Gene OntologyGO:00001661.5e-07nucleotide binding
KEGG pathway 
InterPro domain[92-228] IPR0211394.8e-10Domain of unknown function DUF88
[444-530] IPR0126771.5e-07Nucleotide-binding, alpha-beta plait
Orthology groupMCL13250 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203622-TA
ATGACAATGAGCAAAATGTATCCAACGAGAGGTGTAGACATTTATGTGAGAAATAAAAGTGCAAGAAGTCATTCCGTTCAGGGTCTAGCCACCAATATTACTAGCCCTGCCATGAAAATACCATTACCTCCTCGTTTATGGCTAACAGATGTAGAAGATGAGTCGTCAGATGATTTAAGCCATAGTTCACTAGAAGATGGGACCATTCCAGTTGAAAGGACGAGAATAAGATCACGTTTTCGTCACAAACCGAGGGCATCTGTTAATGTGCCTATTGGAATATTTTGGGATATAGAGAATTGTCCGGTGCCCAGAGGCTGTTCAGCGATTAATGTAGTAGCAGCGATCAGATCTAAGTTCCTCACAGGCCGTCGGGAAGCCGATTTTGTAGTTGTGTGTGATGTGAGAAAGGAAACACCCCAAAAGCTTCAGGAACTGAATGATGCTCAAGTATGTAGCAATACCTTGACATTTAAAATATTAGCTGCCGATGAGAAGCTAAGACAATGTATGCGTCGCTTCGGAGAGCTTCACTCAAGTCCGGCAGCCCTATTGCTTATATCGGGAGACATCAATTTTGCAGCGGATCTCAGCGATTTCAGACATAGGAAGGGTATGGAAGTTATATTGGTGCATAGACAGAATACCTCATCTGCTTTAATAGCCTGCGCCTCCTCTCATTACTCATACAACGAACTTACAGTGAATATACCGAGGAATGTGAATGTAAGTGGGGTGGAGGAAGAGGATCCTAGCCGTGAGATGGAGGTCATTAACCTGCCTATGGACCAGCCGCCGCAGAGGGTTTCCAGGCGTCTGCTGCGACTGGCCGACAACTGTGGGGGCAAGGTGATGAGGGTTGGCGCTGGGACGGCTCTCCTCAGGTTCCCTACACCTGATCACGCTTCACGAGCTCTAAAACGTATGGACGGCGAGGATGTTTTCGGTCGTAAAATTAGTACGCGTTACGCTCGAACGTTCCAATCGACGTATTCCAGCGACGAGGGTTACAGCACCGGCCAGAGGCCGCAGTTTGTTGCACCTGTCGTTCAGCCCCAGCCTCCTCCCCCTCCCCCTCCGTCCGAGCCTCCCCGCAGCGCTTCTAGCGACTGGGCTCTGGCGCTGCAGCAGCTGCCAGCCCCAACACCACCACCACTGCAGTTCTGCCCACCACCAGCACCAAAACCGCGCAAGATTAGAGGAACTCATGGGTCAGCCAGCCTGGACCGGTCTGGTTGCTCATCCACCAACAGTGATGACCGCCGTGAGAACAGTCATAGTCACAGTCACAGCCGAGCAGTGTCTCCTTGGAACTCCTCGGTCAGCGAGCACAGCGAGGATGACACTGACCTCACCGTCTCCAACCTACCACCTTATGAACCACCAGTATTACAGGAAATGCTGACGAAACTGTTCAATCAGTACGTACCGGTGGTTCGGGTGTCAGTGTGGGCGGCGGGCGAGGGTCCTCTGGCTACGGTCGTAGTACGATCTGAATGGGACGCCCGTCTAGCTATAGCCCGCGTCCACAAACGAAGATTGGACAACAATTGGGCTGGGAGACGATTGGAATTATCCCTAGGAAGACCATCGCCGGCTCCCAATCTAGACGTCCTCAGGTCTAGACTGCGAGCTATACTCCTAGACCAGAAAAATTTCAGTCTACCGCTATTGAGACTGCGAGATGCGTACGCTAGTAGACATTGCTGTGCGCTGACTACATCAGACATAGCTAAAGTCAAGGATACGGTTGTCATCCATGAAGGTTTCGGACGTATGGTCCAATTGGTCGATCACACGCCGGTCGTGAACACGGAAACGGAAGAGGCGCCATGGAAGTGTCATATACACGCTGTACTGAATACTAGCCACGAGGACGGAAGCCGTATCTTGCAACCCGTCTACATGGATATAGCGGTTCTGGCTAAGAATACTCAAGTGCTGTTGGAGAATCACGGGGGTATACTGCCCTTGTTGAGTTTCGTGGAATGCTACGAGGCAATGTTCCCGTCGTTAGTAACAGACAATCGTCGCGGCGTCGCCCTGGAACTGCTACTGCGAAGTATACAGACATTGGAAGTTAAAGACAGTCCCTCGAGACATTTAACCTGGAAAACATCCACAGACTCGCCCTCGCAGAGTTATATCAGCGACACTTCTCGCAGCAGCGACCGCGACCGTCCACGCACAGCGCCCGCCTTGGAACCGATGCTGGCGTTGTTTGAGAGAGAGTTGGTGGATTTGTTGAGAACAGCCCCAAGATGCTCAATACCGTTCAGCAAGTTGATACCCGCGTTCCATCATCATTTCGGTCGCCAGTGTCGTGTGGCTGATTACGGCTTCACCAAACTACCAGATCTGTTATCAGCTCTGGGTAACACTATCGTGGTTCTGGGCTCAGGGTCATATCGTGTCATAACGATCTCATCCGCCGCCCAGGGGAGGCGTTGGACGTCGGATTTGTTGAAAATATTGAAGGCCCAACCCGGCCGGGTCATTCACATCCACGATCTGCCGCAGTTATACCAGTCGACCATCGGCAGACCTTTCAGCACCGTCGATTATGGAGTGTGCACCATGGACGAGCTGATGGAGAAGGTGTCCCCTCAGAGCGTCATAGTCTCACCAGAGGGCACCATCTCACTCCCACGGAGAACCCCGACGCCGGAAGAACGAACGAGAACTGTACAGTTCGCGGTGCAAGCGGTGGAGCTGCTTTGCTACACTCCGAACCTCAGAATGGAGTTTTCGCGTTTCGTGCCGGCTTATCACGCGCACTTCGGGAGGCAGTTACGTGTTGCGCATTACGGATGTGTTAAATTGGTGGAACTGTTCGAATTGATACCGGAAGCAGTGAGTGTGTACTGCGAGTCTTCCGGTGAGAGGAGCGTGCGGCTGGGGCTGCAGACTGCTACGGCCGTGATGGCGCAGAGATTGAAGAGCCTGGCGCCCGTGTCGATCACCACCTTCCCCTCGCGCTACGCCGCCCAATTCGGCGCCCCGCCCCTACCTGATTGCTTAGACGCTCCTAATCTCGAATCCCTTGTATACGCAGCCGGTGGTTTCATTGAAGGTGATATAATTCACGTCGGTGATTCATCTCAATGGGCCAATTCGGCGCTGTCAGCGTGCGCCGTGCTGTCGGCCGACCGCAGCGTCGCCAGGGGATCCACCGAGGAATATTTCATGACGGCATTCCGCTCGCTGTATGGCATCGAGCCGGACGTGAGCAATCTAATGATGTCCGGTGTCCTCGACGTGTCGGAGCGTCACGTGTGCCTGACTAACACCTGGCGTACCGTGTGGCGTGTCGCACAAATCTTGTCCGATTACCCGGGGAACGTGGCGGCAGTCGAGATATTCATCGAGTACTCAAAGAGATACGGCCCGTCGTTCCCTAACGCGGAGCTCGGTATGGATGCTATGAACACGTTTCTGAGTAAGCATCGCAGTGTTTTCAATACTAGTGATGGCCGCTGGGGTCTCTCGGCCGGCGTGACATTACCCAGACCGGAGTATTCACTACGCGCTGAAGATTACTCACTTCACGATACACCACCGGGGCAGAAGGGATCCCGCGTCTTTGAATCTCCGAAGACGAATATTTGGAGTTCGCCGCCGGCCAGCGCCCTACCAACACCGACTGCTCTACTCAACCACGATAATAGACGTCGCACCCGTCTGGCAGCACAGTTTGATGCAGCGTAA

Protein sequence:

>DPOGS203622-PA
MTMSKMYPTRGVDIYVRNKSARSHSVQGLATNITSPAMKIPLPPRLWLTDVEDESSDDLSHSSLEDGTIPVERTRIRSRFRHKPRASVNVPIGIFWDIENCPVPRGCSAINVVAAIRSKFLTGRREADFVVVCDVRKETPQKLQELNDAQVCSNTLTFKILAADEKLRQCMRRFGELHSSPAALLLISGDINFAADLSDFRHRKGMEVILVHRQNTSSALIACASSHYSYNELTVNIPRNVNVSGVEEEDPSREMEVINLPMDQPPQRVSRRLLRLADNCGGKVMRVGAGTALLRFPTPDHASRALKRMDGEDVFGRKISTRYARTFQSTYSSDEGYSTGQRPQFVAPVVQPQPPPPPPPSEPPRSASSDWALALQQLPAPTPPPLQFCPPPAPKPRKIRGTHGSASLDRSGCSSTNSDDRRENSHSHSHSRAVSPWNSSVSEHSEDDTDLTVSNLPPYEPPVLQEMLTKLFNQYVPVVRVSVWAAGEGPLATVVVRSEWDARLAIARVHKRRLDNNWAGRRLELSLGRPSPAPNLDVLRSRLRAILLDQKNFSLPLLRLRDAYASRHCCALTTSDIAKVKDTVVIHEGFGRMVQLVDHTPVVNTETEEAPWKCHIHAVLNTSHEDGSRILQPVYMDIAVLAKNTQVLLENHGGILPLLSFVECYEAMFPSLVTDNRRGVALELLLRSIQTLEVKDSPSRHLTWKTSTDSPSQSYISDTSRSSDRDRPRTAPALEPMLALFERELVDLLRTAPRCSIPFSKLIPAFHHHFGRQCRVADYGFTKLPDLLSALGNTIVVLGSGSYRVITISSAAQGRRWTSDLLKILKAQPGRVIHIHDLPQLYQSTIGRPFSTVDYGVCTMDELMEKVSPQSVIVSPEGTISLPRRTPTPEERTRTVQFAVQAVELLCYTPNLRMEFSRFVPAYHAHFGRQLRVAHYGCVKLVELFELIPEAVSVYCESSGERSVRLGLQTATAVMAQRLKSLAPVSITTFPSRYAAQFGAPPLPDCLDAPNLESLVYAAGGFIEGDIIHVGDSSQWANSALSACAVLSADRSVARGSTEEYFMTAFRSLYGIEPDVSNLMMSGVLDVSERHVCLTNTWRTVWRVAQILSDYPGNVAAVEIFIEYSKRYGPSFPNAELGMDAMNTFLSKHRSVFNTSDGRWGLSAGVTLPRPEYSLRAEDYSLHDTPPGQKGSRVFESPKTNIWSSPPASALPTPTALLNHDNRRRTRLAAQFDAA-