Monarch geneset OGS2.0

DPOGS214262
TranscriptDPOGS214262-TA5046 bp
ProteinDPOGS214262-PA1681 aa
Genomic positionDPSCF300014 + 1565294-1571917
RNAseq coverage245x (Rank: top 42%)
Annotation
HeliconiusHMEL0113820.075.00% 
BombyxBGIBMGA005980-TA0.065.36% 
DrosophilaCG9932-PA1e-10541.30% 
EBI UniRef50UniRef50_D6WJU80.039.97%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WJU8_TRICA
NCBI RefSeqXP_969912.20.039.97%PREDICTED: similar to GA22134-PA [Tribolium castaneum]
NCBI nr blastpgi|1892380030.039.97%PREDICTED: similar to GA22134-PA [Tribolium castaneum]
NCBI nr blastxgi|1892380030.035.71%PREDICTED: similar to GA22134-PA [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL17753 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214262-TA
ATGTCCTCGCCGCAGAGCGGGCAACTCGGCGAGGTCACGACTCTCGTGCGCCCCCTCGAAGGCCGCCTGTGCGGCGACGCATCATCTGAGGCCTCCTCGGATGCAGCGGAAGATTTTGAATTGCCACAGTCCAAAATAAAGCGAAACTATAACTGCACGAAGTGTACGTTTTACACTCAGAACCCACGCTATTATCTGCTACATATGCGTGACGTTCACTTTGAAAAACTGAAGATTTATGACTGTCCTCATTGTCTTTATGCTTCGCGACATCAACAGACGCTAATGAGACACTTAAAGATGGTTCATGAAGCAGCGGGCTCAACGAAGGCAGCGGAACCCTTGCCTTCCACTTCTGAAAATACAGATCCTATTGAAAGGATGGAAGAATTACTCGAAGAAGTAGAAGAATGCGATGACATTCTCATGGAAATCGAAGAATGTAATGAAGAAACCTTAGAGCGTAACTTTGGTTCTGAGTCACATCAAGGAGACTCCATATCTTCGGACCAACCTATAGATAAAAATAAATTCTTTTCTTGCAACAAATGCAATTACGTGACACATATTAGAGCAAGATATACAAAACACGTCAAGTATCATTCTATGCCAATGATAAAGTGTACCATGTGTGACTTCCGTACGCCTTACAAATGGAATTTGGATCGCCATATGAAAAATCATGGAGGGAACGGAAGCTTCTACTGTTCGATGTGTAATTTCACCGCCGATATCAGACAAAGTTTGACAGTTCACGAAATGAATCATCATACGCCGCCCGTCGGCCAGCTGAGTTCCAATCGGCGCCGGAACCGGGTGGGGGCAAGCGACGTTGAGGCAACGGATGGGAGCACTCTCATTGTAAGAGAGGAAGAAGGAAGCGGTGACTCTCGGTCCTCACACTCTGCATCGGAATCAGGTGCTCAATACATGGAGAGAGACATCATTTGCTGTAACAGTGACTCCAACGAAGCATTTGCTTCAGAAACACCGACAAAAGATAATATGCATCTAAGGAAAGAATTAGAACAGCAAGCTGCTTCATCAGACGAGGGCAAAAATCAGAAACAAAGCAGAAAGATTCCTAGACCCATCCCGCAGTTAATACCCCTTAACACTTCAACTCCTACTCAAAGCAAGACAAACTCTGGGGAACCACCAGCGAAAAAAATTAAAGAATCTGAAAAAACAGAAAATGTAGCACCGGAAATATGTCGAACTTCAGTTAACAACACAGTTTCTAAAGAAAATACAAAATCCAAAAAGAACGAATCATTTTTCGACAGATTAAAAGAAAGATTGCTTACGGAAACCGGCGAGGAAGGGACACTAGTGTGTAAAAATTGCGGATTTGAAAGTAAATGTTTATCTGAACATTCTGTCCATGAAAAAAATTGTTCCGCACAATCAAACCGAATATCAACCAATCCTTTGCATTCAAGTTTAGGATCAACGCGATGTCAAAACTGCAGACATCGATGCAAATCGAGTGCAGATCTTTACATACATATGCAATCATGCAAAAAGAAAAATGATTCTTTAGAAAACACAACAGAAACATACAACCAAGACAGTAGTGAAACTCCCTCCATTAATCTTGAAAAAGAATCCGAACCCCACCCTATGGAAAATGTAGTTTTTGTCTGGAATAATATCAATCAAAATAGTAACAAATTTAATACACCGTTAGACATAAGTATAAATGATGATTCCACGCTTCCAGAACAGGGAACAACTATAGAGCTAGATATAACAGATGAAAATGAAGCGATGAGTTTATCACCCAGCCAAGCGTATGGGAAAAATGTATTTAAATGCCCTCATTGCTTGTTTTGGGCATCTACAGCTTCTAGATTCCATGTCCACATCGTCGGGCATCTTAATAAAAAACCGTTTGAATGTTCATTATGTAAATATAAGTCTAACTGGAGATGGGATATTACTAAACATATTAAACTTAAATCCGCTAGAGACCCTGAACACGCGGATGCTAAGGTTCTTATGACCGACGAAACTGGGAGGAGAAACTACACCAAATATAATAAATTTTTAGCAATGCCAATGTTAAATGAAAATGGGAAAACTGAATTCCATTACATTGATCAAAGCACTACCATAGACACAACATTAGATGATGATTCATATGATATAAATGAAAGTACCAACAATTCTTTTGATCTACAACCACTCAATCTACAAACACAGCCAAACGAATTTAAATTTGAAATAGACGGTAGGATTCAAGAAACTAAGAAGCCTAAAAAATCTGTGTGGAAATGTAAAAAATGTAATTACAGAGATTCGTCGAAGGAAGCTCTGTTGGAACATGTTCGAGAACATTCGAAGCCAGAAGAAGCTCTAGAGGACGGCAAACTTCACATAATCCCTAATAAAAAGCCGGAAAACACGCCCGACCCTGCTGACTTAGCTTACCGATGTGGCCATTGTAATCAACTATCTAATTGGAAACATGTCATACAGAGACACTGTCGGCTAAAGCATGATGGAGTTATAAAAGTGATAACAACAATAAAACCTAAACCTGAAGCAAACACATCGACCCCTTCCTCTGCCAATGACCCATCTAATGATACTTGTACCAAATGTCCATACAAATCAACAGATAAGAATACGCTAATAGCACACTTACAACAACATCAACCGTCCTCACAATCAATATTCAAATGTTATTTCTGTCCGTTCTTTGTAAAAGATGAACATGAATTAATACAACACCTTGTTCTACATGGTATTACTGATCCAGAAGAATATATATCAAAAGCTATGGGTTGCAGATCACCTTTACCAGAAACTAATACATCGGTCAATTACTGTGGTACCAAACGTCATAAATGTACAGAATGTCCATACGAAACGAATAGTAAATCTCAGTTCATTTATCATGAGCAGTTTCATCGACTGCCCGCTGATACCCCTTATAAATGTCAAGAGTGTAATTATAGCGTCTCAAAAAGACATTTACTACATCAACATATGAGGGTACATAGTATTTTTGCCAAGAAAAGTGAAATGGACATTGAATTAGAAGCCGTCAATTCTAATCAAGAAGACATGAAAACAGACTTCTTTATAAATTTTGATGAAATCCCCTTCGTATGGGTCTCAGCAAAGAATGACTTTCATAAAATGTATAAATGTCGCTATTGCTCGTATGTTAATTCACAAAAGAGCACAATACCTAATCATGAAAAAATTCACTGTATATTATTTGAAAACAGTGATATAACTATATATAAATGTCTTGAGTGCAAGTTTACTTGTGACACTAAAATACGGCTAGCAGAACATTCCAAGACGCATGGTGAAATATATGGTCGCATTTATTGTCAAGTAGAGCCCGACGTGCCCGACGAAGAACAGATAGCAAAACTACGGAAAGTTATAGACAAAGATAAAATAAACTCGCTAGATGAATTACGAAGCGACGAAAATGTTATTGATATTAATACTAGAGATAACAAAGTTTTATATTTTTGTCAAAAATGTCCCTCTAGATTCTTTTCTGAGAGCGAGTTAAAAGTTCATGATAAATTTCATGACCTGTCTTTTTGTAACAAATGCAAGAGTTGTGAATTTTCGGTACCTCAGGAAAGTGATATGACTGCTCATAATATAAGCCATACCGACGAATATAATACAAAAACAAAAATGCTCAAATTTATTCACAAAATCCATTCCATTTATAAAGTACCCAAATTGCAGCTTGTACATTGTCCGATAACTTCGGAAATGACATGGGTTGTAGCTAATCCGGGAAACAATTACAATATAAATGAAAACAACACTAGGAAATCTACCGAAACAAAGCATGCACCAAAACAATACCTTTGCAAAGAATGTCCTGCGAAATTTTTTAAGAGTTCAGCATTAAGTTATCATATGGGATTACACGGTGGAGATGGAGACCATAAATGCAAAAAATGTAGTTATTCTGTAAAGAACATTGGAAACCTTGCAAAACATGAATTACTTCATGAAAATGAACATAAAGTTTCAACAGTAGATTATGAATCTGGGGAGGATTTAGACTATAAAAATATTCCCTTGTCTGGAACTGACCTATTTCAAAGAAAAACTGAAGCCCAGAAAAGAGTGTTAACTGATAAAGATAAACTCGTTAAGCCAAACGATCATTTTCCTCCAGTACTTCAAGCTGATCCCCAATTTGGTTATTTAATGCACGGCAATCCAGAATTTATATATCCCACCTATTTAAAGAATGGACGCCAAAAAGAGAAACGTTACAAATGCCATAAATGTCCATCGGCTTTCGAAAAGCGCGAGCAATATAAAATCCACTTATCCCTTCATGGTTCAAAACAAAGGTATAAATGTGAACTTTGTGATTATTCCGTTAAATATTATGCTAATTACGTGCAACATATGCGAAAACATCAAATGAATGATGAAGCTCAAGCTGAAAGAAAAAAATGTAATGGTTTTATTGAAACTGAGAACGACAAAGCAGAGAATAAACTCGACGACAGCGGTGATAATATAAAACTAGCAATTAAAACAATGCCAAAAAGACCACCTAGAAGCGATTTTCAACAATTTTCAGTGAGCGATCAGCAGACACTTCGTTTGTTACAACGTAGGCGTTCTATGAATAATTCATCTAAAGATGCTAACACATCTGACGTTTCGCCAATAAAAGATCGTAAATTGCATGTATGTCTACTGTGTCCGTATACAAACCAACGTCAGGACGCGCTTTGGAATCATTACAGAAGACACGATGAAACAGAGCGATTATGTTCTGGTAATCAAAAATGTTCATACTGCGATTTGGTTGTGGTGCAATCTCATTTCCTCCGTGAGCACCTAAAAACACATTTTAACTATCAGAAGAACCTGACTCCAGAATGCTTCGTTGCTAACGAAAACGTTAGCTTTACAATAACCAAATTAGATGACAACGATTTGTCTAGTGATCTCAAATTAGATTCCATTAATCAAAGTTTACCATGTAGTGATAATAAAATATTTGTTAAAATTAAAACTGGTGAAGTATATGTTGAGTAA

Protein sequence:

>DPOGS214262-PA
MSSPQSGQLGEVTTLVRPLEGRLCGDASSEASSDAAEDFELPQSKIKRNYNCTKCTFYTQNPRYYLLHMRDVHFEKLKIYDCPHCLYASRHQQTLMRHLKMVHEAAGSTKAAEPLPSTSENTDPIERMEELLEEVEECDDILMEIEECNEETLERNFGSESHQGDSISSDQPIDKNKFFSCNKCNYVTHIRARYTKHVKYHSMPMIKCTMCDFRTPYKWNLDRHMKNHGGNGSFYCSMCNFTADIRQSLTVHEMNHHTPPVGQLSSNRRRNRVGASDVEATDGSTLIVREEEGSGDSRSSHSASESGAQYMERDIICCNSDSNEAFASETPTKDNMHLRKELEQQAASSDEGKNQKQSRKIPRPIPQLIPLNTSTPTQSKTNSGEPPAKKIKESEKTENVAPEICRTSVNNTVSKENTKSKKNESFFDRLKERLLTETGEEGTLVCKNCGFESKCLSEHSVHEKNCSAQSNRISTNPLHSSLGSTRCQNCRHRCKSSADLYIHMQSCKKKNDSLENTTETYNQDSSETPSINLEKESEPHPMENVVFVWNNINQNSNKFNTPLDISINDDSTLPEQGTTIELDITDENEAMSLSPSQAYGKNVFKCPHCLFWASTASRFHVHIVGHLNKKPFECSLCKYKSNWRWDITKHIKLKSARDPEHADAKVLMTDETGRRNYTKYNKFLAMPMLNENGKTEFHYIDQSTTIDTTLDDDSYDINESTNNSFDLQPLNLQTQPNEFKFEIDGRIQETKKPKKSVWKCKKCNYRDSSKEALLEHVREHSKPEEALEDGKLHIIPNKKPENTPDPADLAYRCGHCNQLSNWKHVIQRHCRLKHDGVIKVITTIKPKPEANTSTPSSANDPSNDTCTKCPYKSTDKNTLIAHLQQHQPSSQSIFKCYFCPFFVKDEHELIQHLVLHGITDPEEYISKAMGCRSPLPETNTSVNYCGTKRHKCTECPYETNSKSQFIYHEQFHRLPADTPYKCQECNYSVSKRHLLHQHMRVHSIFAKKSEMDIELEAVNSNQEDMKTDFFINFDEIPFVWVSAKNDFHKMYKCRYCSYVNSQKSTIPNHEKIHCILFENSDITIYKCLECKFTCDTKIRLAEHSKTHGEIYGRIYCQVEPDVPDEEQIAKLRKVIDKDKINSLDELRSDENVIDINTRDNKVLYFCQKCPSRFFSESELKVHDKFHDLSFCNKCKSCEFSVPQESDMTAHNISHTDEYNTKTKMLKFIHKIHSIYKVPKLQLVHCPITSEMTWVVANPGNNYNINENNTRKSTETKHAPKQYLCKECPAKFFKSSALSYHMGLHGGDGDHKCKKCSYSVKNIGNLAKHELLHENEHKVSTVDYESGEDLDYKNIPLSGTDLFQRKTEAQKRVLTDKDKLVKPNDHFPPVLQADPQFGYLMHGNPEFIYPTYLKNGRQKEKRYKCHKCPSAFEKREQYKIHLSLHGSKQRYKCELCDYSVKYYANYVQHMRKHQMNDEAQAERKKCNGFIETENDKAENKLDDSGDNIKLAIKTMPKRPPRSDFQQFSVSDQQTLRLLQRRRSMNNSSKDANTSDVSPIKDRKLHVCLLCPYTNQRQDALWNHYRRHDETERLCSGNQKCSYCDLVVVQSHFLREHLKTHFNYQKNLTPECFVANENVSFTITKLDDNDLSSDLKLDSINQSLPCSDNKIFVKIKTGEVYVE-