Monarch geneset OGS2.0

DPOGS206222
TranscriptDPOGS206222-TA4323 bp
ProteinDPOGS206222-PA1440 aa
Genomic positionDPSCF300334 - 157521-170056
RNAseq coverage42x (Rank: top 72%)
Annotation
HeliconiusHMEL0112290.062.03% 
BombyxBGIBMGA009742-TA5e-16457.52% 
DrosophilaCG42326-PD3e-4030.15% 
EBI UniRef50UniRef50_E3XBU02e-4531.00%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3XBU0_ANODA
NCBI RefSeqXP_974545.21e-4535.62%PREDICTED: similar to CG14748 CG14748-PA [Tribolium castaneum]
NCBI nr blastpgi|3123727037e-4531.00%hypothetical protein AND_19812 [Anopheles darlingi]
NCBI nr blastxgi|3320243332e-5628.12%hypothetical protein G5I_07039 [Acromyrmex echinatior]
Group
KEGG pathway 
InterPro domain[38-110] IPR0030141.6e-06PAN-1 domain
Orthology groupMCL15441 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206222-TA
ATGTTGAAGTGTTTTCCATATTTGTGCCTGCTAGTACATGTGGCGGTCTGTTCCCGAGTTATACTCGTAGAGAATGCACCAACTTGTTTCCGACGAGTGCTCGCTGGGAAGAGAGCTCTGAGGAGTCACGTGAGGAGAGTCGTGGACTGTGAGCGGCTGGAAGACTGTCGGCGAGAGTGCGCCACTGAGAAGAGGTTCCCTTGTGAGAGTTTTAACTACAGATTGGACCCCAGCTTCCGAGGGAAAGGTCTGTGCGAGCTGATGACCAAACCGATAGAAGCCTTCGACCTCCGACAGGACTTCGTGGAAGATAAGGATTACGATTTCTACGAAGTGGACAGGAACAGTTTGGAACCTTACTGCCCGGAAACACTAAGGGGGCCGGGGCTCTTACATTCTGGTTACCTGTCATCCAAGAACAAGTTGACGTCTCAGGACCAGTGGCGGGACAGGAACTGGTCGGCTCACGATCACAGGACGTACAACAGGTTCTTTGACGAGAAGTCTCATAACAGACGCCACTGGGGGCTCAAAGAACACGGCGAGGAGTATCAGAGAGAGGACAGCTCCACCTACAACAGTCTGAGCAGAAACCACGAGCAGGAGTTCAACTACTACAGTCTGAGGCGCCACGGAGAGGAACACCTGGGTTACGGCTACGGCACCTGGAAGAAAGGCAGGTGGAACAACTCCGGCAACTTCTGGAGGGACGACACGCCCACTAAGATAGAATATGATGAAGAAGACAATGTTAAAGATTGTTCGTCCCGCCGTCGGCCGGGCATGTCCCTGGGAGTGGGCGCGGTGAGACGGTCCCTGGCCGCCAGGACGGTGGTGGACTGTGAGGCCGCCTGCTTCGGAGAGAGGAACTTCAAATGTGTTTCGTACAGCTACAGGTACTCGAGTTCGCCAGGCTCTGACAACTGTTTCCTGAGCGAGAGACCCTACAAGGGCCTGGAGATGTCCGCGGACAGCAGCTCGGACGTGTACGCCATGCCGCTGCATCACGACTGCCTCACCATCAGCACCAAGCCCTGGGTCGAGAGCGGTGGAGCTGGTTGTTCTGTAGAGTGTTTCTGGCACGTCCGGTCGGGCGCCGCCCTGAGTCGGGCCTCGGTCCGCTCGTCGCTGACCGTCAGCGGTCTGGGCGCCTGCGAGGCGGAGTGTATCAGGGCTCACGGCTTCTTCTGCAGAGGATTCAGCTTCAGGTTCGATCCTCCTACAATAGGCGACGACCTCGAGAACTGTCTGCTGACGTCATCACCTCCCACCACCCTGGACCTCTCGCGCGGTCTGACTCCCAACAAGCACGAGCTGTACTCCCGCGGGAACTACGGCCGGGGATGTGAGCCCGCGCTCTATGACGACGCTGAACATGAACCGCAGTGTTACCTCCAGTACGTGGAGTCGGCCCGCCTCAGCCGCGGTGCGGTCCGCGGCCGAGCGCGCACCTCAGACGAGAGAGCGTGCGGTCGAGCCTGCACCGACGCCCCCTTCAAGTGTCTCAGCTTCTCGTACACAAGCAACGCTCCCCCTGACAAGGACAACTGTCTGCTGTCAGAGATCCGTCTCTTCGATTTGCATCGCGGCGTCGACTACGAACACTCCACAGACGACTTGCTGTTCGCCTTCGACCTGTTCAACGGACAGTGCTGGAGGAAGATCCACGGGAAAAATGAATATGAAGTTCCGACACTTGAAGTGCCGCATCCCATACAGACAGAAGAGAGCTATCCCTTGACATCCGGTCCAGACGCGCCACCTTCAGAAACTTATATAACTAGCTCGGGTCCGAGTGGCCCACCAGCTCACAAACCGTACATAATAGAAGCCGACTTTAAACCCGGCTCTAAACCATACCTTGAAAGTGGAGAGAGCGATAGACCCTTTGAATACCCGGAACCAGGCTATAAACCGTATCATAAACCGTATAGACCAGATTATGAACCAGAGAAACCTGACGTAGGTCATAGACCTAGACCGAGTGGACCAGAACTGACACCACCTTATGGTTCAAATCCATCATATGATGGTGGTTCATCTATCACTCACTCCAGTGGGACCTTAATAAGTCAGTCAAGTGGTGCGTCATACGGCTCTTCAAGCAACTTCGCTGGCGGTTCATCGTATGCTGGCCATGAATCACATGTCGGTTCAGATCACAGTTCCTCCGCATACGGCGGTTCCGCCAGTAATTCTAACTATGGTTCTGCAAGTAACTCTGGCTATGGGTCAATAAGTAACTCAAACTATGGTTCTTCGAGTGGTACAGGTTATGGCTCTTCAAGTAACTCCAATTATGGCTCTTCAAGTGGCTCAGGATACGGTTCCGCCAGTAACTCCAACTATGGTTCTGCAAGTGGATCAGGCTATGGCTCCTCAAGTAACTCAAACTACGGTGCCTCAACTGGTACAGGTTACGGCTCATCAAGTTACTCAAATTATGGTTCCTCAAGTGGTTCAGGCTATGATGCATTAAGTAACTCCCACTATGGTTCTGTGAGCGGCTCAGGATATGGGTCATCCAGTAATTTAAACCACGGTTCCTCGAGTGGGGCAGGTTACGGTTCAGTGAGTGGCTCCAGCCATCACTCATCCAGCGGTTCAAGCTATGGAGCCTCAAGTGGTTCAGCGTATGGTTCATATGCTGGTTCAGGCTACGGGTCCTTAGGAGGTATCGCAGGAACTCGTCCTCGTCCTAATCCTGTCCAAAGTGATCATCGTCCTAACAGACCAGGCAGACCGGGAGACAGAGGCGATGACAACCTGTCTTTGTCGTGGCGGCATTATACCGTGTCCGGGTTCCCGTGTCGGCGCGGCACCGCGTGTGAGAGGAATGTGATAGCGGGCCACTGGGCGTGCGAGCCGGAGGGGGGAGAGATCGGCTCTTGGGATTACTGCTGCGCTCCCACACATAGGTGCGGATACAGCGAAGGGTTCCGGAAACCTTGGTGTTATGTGGGTCCGGCGTCCGACCAATGGCGTCCGTGTAGTGAGAAGTACTATCCGTACCACCAACACAATTTTCCTCACCCCTCGCAGGGACACAGAGAGTCAGACCGCCCGCAAATAAATATACCTCAAGGCCAAAAGACGTACCCAGAGAGGGACAGACTTTCATCTGGCTATCTGTCCTCCGCTGATAGACGGTACTGGGATGACCTGTACGAGAATGGTCCTCGAGCTTATTACGATAAATATGGGAATCCTTTGCCAGGCTTCTCGAAAGTACCAACGGAGAGCCGACCGCACATAAAATACGAACGTAACCCACCACGACCCAGTTCGGGTCAGTGGGTGCCCGTCAATACTTTACCTGATGACGAGGTGCCTCCACCAGGTCTTGGCGTGCCAAGATATTGGCCAGTGGCATACTTACACAAAGGACCTCCGCCCAACATGACGTACTTTAAATACAACGAAACCGAGAGAACGACTCGCTCGCCCCAGGAACACACGACCACTAACAGAGCGAGCATCAACCAAATAGAAGCTAGGTCTGGAGACAGTCCGAGGACTGAGAAGAGAATTAATATCACGAACGACGACGCCGACTACCTCGACGTTACCACGACCAGAACCGCTGGGAACGAGACGCGGGAGAGCACGACGGAAAGAGAAGTTAAAAACGAGACAGACGCTGAAGAGTACAGAACAAACCCGAAACTAAACGGCGATTACATCAAAGGAATCGACGGGAAGTTGCACGACTTCACGACGTCGTTGGAGGTGTTCGATATAGACGACGTGAAGAACGATAAGCTGTCCCACCTGAGAGCCGCGGAGGCTGAGGAGAAGCAGATAGAAGCCATCGGCAGGCTCCTGGCCGCGAGGCGAGCGAAGATAGTCGTGGACAAGACGTCGCAGAGGAATCTGGAAGATAAGAACATTGCGCTCGACAAAGATTTCATGGATTTCAATTTCGGCAACAAATTCCCGGTCGAAAGACGAGGAGTCGTCCAGAGAGTTTCCAAAGACGAGATCGAGAACAGAGACAAGAGTTTGGAAGTCAGCGAGACGACCTTCGTTAGACCTCCGAGGGTTCTGAGCACCACGGAAAATATAAGGAAAGCAGTCGTCAACGGAAAGGTCTACTACGAGGCCTCGCTCCGCAGTCAGAGGGACCTGTACACCAACTCCACGAGGAGGCCTAAGAACCTGCGAGAGACGAGGACCCTCCCCACCAACAACAAGAAGAGAACCAGGAACACGAACCCCGTGCGACGAGCGAAGAGAGTGTACAGGAAGAGATACAACCCGGAGGAAGTGAGGAAGAGATTGTTAGAGAGAGAGAGGAACAAGAACATGAGGGACTCGCGCTAA

Protein sequence:

>DPOGS206222-PA
MLKCFPYLCLLVHVAVCSRVILVENAPTCFRRVLAGKRALRSHVRRVVDCERLEDCRRECATEKRFPCESFNYRLDPSFRGKGLCELMTKPIEAFDLRQDFVEDKDYDFYEVDRNSLEPYCPETLRGPGLLHSGYLSSKNKLTSQDQWRDRNWSAHDHRTYNRFFDEKSHNRRHWGLKEHGEEYQREDSSTYNSLSRNHEQEFNYYSLRRHGEEHLGYGYGTWKKGRWNNSGNFWRDDTPTKIEYDEEDNVKDCSSRRRPGMSLGVGAVRRSLAARTVVDCEAACFGERNFKCVSYSYRYSSSPGSDNCFLSERPYKGLEMSADSSSDVYAMPLHHDCLTISTKPWVESGGAGCSVECFWHVRSGAALSRASVRSSLTVSGLGACEAECIRAHGFFCRGFSFRFDPPTIGDDLENCLLTSSPPTTLDLSRGLTPNKHELYSRGNYGRGCEPALYDDAEHEPQCYLQYVESARLSRGAVRGRARTSDERACGRACTDAPFKCLSFSYTSNAPPDKDNCLLSEIRLFDLHRGVDYEHSTDDLLFAFDLFNGQCWRKIHGKNEYEVPTLEVPHPIQTEESYPLTSGPDAPPSETYITSSGPSGPPAHKPYIIEADFKPGSKPYLESGESDRPFEYPEPGYKPYHKPYRPDYEPEKPDVGHRPRPSGPELTPPYGSNPSYDGGSSITHSSGTLISQSSGASYGSSSNFAGGSSYAGHESHVGSDHSSSAYGGSASNSNYGSASNSGYGSISNSNYGSSSGTGYGSSSNSNYGSSSGSGYGSASNSNYGSASGSGYGSSSNSNYGASTGTGYGSSSYSNYGSSSGSGYDALSNSHYGSVSGSGYGSSSNLNHGSSSGAGYGSVSGSSHHSSSGSSYGASSGSAYGSYAGSGYGSLGGIAGTRPRPNPVQSDHRPNRPGRPGDRGDDNLSLSWRHYTVSGFPCRRGTACERNVIAGHWACEPEGGEIGSWDYCCAPTHRCGYSEGFRKPWCYVGPASDQWRPCSEKYYPYHQHNFPHPSQGHRESDRPQINIPQGQKTYPERDRLSSGYLSSADRRYWDDLYENGPRAYYDKYGNPLPGFSKVPTESRPHIKYERNPPRPSSGQWVPVNTLPDDEVPPPGLGVPRYWPVAYLHKGPPPNMTYFKYNETERTTRSPQEHTTTNRASINQIEARSGDSPRTEKRINITNDDADYLDVTTTRTAGNETRESTTEREVKNETDAEEYRTNPKLNGDYIKGIDGKLHDFTTSLEVFDIDDVKNDKLSHLRAAEAEEKQIEAIGRLLAARRAKIVVDKTSQRNLEDKNIALDKDFMDFNFGNKFPVERRGVVQRVSKDEIENRDKSLEVSETTFVRPPRVLSTTENIRKAVVNGKVYYEASLRSQRDLYTNSTRRPKNLRETRTLPTNNKKRTRNTNPVRRAKRVYRKRYNPEEVRKRLLERERNKNMRDSR-