Monarch geneset OGS2.0

DPOGS201757
TranscriptDPOGS201757-TA2019 bp
ProteinDPOGS201757-PA672 aa
Genomic positionDPSCF300279 + 32199-42070
RNAseq coverage317x (Rank: top 36%)
Annotation
HeliconiusHMEL0067120.075.22% 
BombyxBGIBMGA002652-TA0.072.04% 
DrosophilaCG18304-PA3e-5443.19% 
EBI UniRef50UniRef50_D6W6X41e-13649.03%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6W6X4_TRICA
NCBI RefSeqXP_968604.23e-13348.38%PREDICTED: similar to CG18304 CG18304-PA [Tribolium castaneum]
NCBI nr blastpgi|2700149975e-13649.03%hypothetical protein TcasGA2_TC013627 [Tribolium castaneum]
NCBI nr blastxgi|2700149972e-14048.93%hypothetical protein TcasGA2_TC013627 [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL15606 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201757-TA
ATGCACCATATGTATCCAATGGCTAAAGGAGATCCGCTCTGTCCCTTGGGCTTCCACCCGCAAGTGCGATGGCCGACTCGCTGCAAGCGCTGCTTCCGCGACTACAAGGAACACGGTGGTCGTAAGAAGGAAGATGATTTTGCTGCATCAACACCGAGCCTCTCTTCTTGGAACTCGCCCTCATCACGAAGTCGTGACGAAAACAATGGTGCTGAGGTAGAGAAGACAGGTCGCGGTTGGGCTTCGAGCACAAACCTCAGTATCACTGAAGTATCCAAAAAAGACGACATAACAATAGGGCTCAATAAGAAGAGCACATCGTGGACGTCAACACCAGACTTGGGTTCAAATGAAGAGGATAATACAACTGCTGTAACATTTAGTCTTAAATTACCCAAACGAAGATCTACAGGTCCTTTACCATCTCTAGATACAGGACAGAATAATACAGAAACGGTAACAGTACGTCGACCATCTCCCTCGCCACCACCAACCGTCAAACAACCAGACAGCCAGCCTTCTATAGCTCAGATAACTATCAATAAAAACGATTCGCTTGCAGAACGAGTTCGAAAGATGCAATTAATGAAAGCTCAAAGCAGCTTTGACAAAGAGTCTAGCGTTGAGAAGGAAAGAGAAAGGAGAAGTGCCTCTCGAAGTAAGGAAGAAGAAAAGTCCGTGAAACCAAAAGAAGAAAAGAGTTACAGTAAGGAAGACGTTAGTATATCGCTGGGAAAAACTCGTAATCAATCCATATCAAAGCCTCCTATTAGGGGAGGACCTATATCTCGAGATAAGGAAGAATCGGAGGATGAAGGCAGCACTATCACAACAGACACAGATGTAACACTTGTCGATCCGAACATAAAAGAGTATCAGGAACAAATTGAAAGCCTAAAGTCAGAAGTAGATTTTCTGAAAAAGCGTTGTGAACGTGTTGAGAAAGAAAAGAGTGACATTCTATTAAGAAGACTAGCTAACATAGATAACACTAATAAATATTCCACTGGACGATCTTCAGAAAATCTTAAGTTACAAAAGAAGGTTAATGAACTTACAACACAAAACGAAGACCTAAAGGATGAAAAGAAACATTTATCTCTACGGATTAAAGAAATGGAGGTAGAATTAGAATCTCGTCCATCAATTGAGGCACAAACAAAACAAATTGAACAACTCAGAGCAAAGTTATTGGCTGCAGAAACTCTGTGTGAGGAATTGATGGATGAAAATGAAGACATGAAAAAAGAATTACGAGATCTTGAGGAAGAAATAGAAGAAATGCAGGACAACTTCAGAGAGGACCAAGCAGATGAATATTCTTCGTTAAGAAGAGAATTAGAACAAACAATAAAAAACTGTAGAGTGCTATCATTTAAATTGAAAAAAACGGAGCGAAGGGCCGAGCAACTTGAACAAGAAAAAGCGGATCAGGAGAAAAAGCTTTTAGAGATTGTAGGAGGTGCTGACGGTCTACAGAGAGAGAATCGAATCAAGGAATTGGAACAAGAAGTAGCGCGGTCTACTGAAGTTGCTCTGCGACTGCAGCGTGAATTGGCAGAAGCGAAGACTAAATCATCTTCTGTTTCTGGTGTTCCGCCTGCCAATGTTAAGAAGCAAACAACTGACGGAAAGATTTCCCGATCGTCACTAACACGAGGTGGAAGCCAAGAAGATGCAGCGCAATTGCTTCGGGATCTTCAAGACTCTTTGGAACGCGAAGCTGATTTAAGAGAACAACTACGCAACGCTGAAGAAGAGGCCATTCGATACCCTGGAAGCTTCAGTGATAAACGGAATCTTCCACCACCGCATTCACCGCCAACACAGCTTCATCCTCCGCCTAACCTTGCTGTCACTGATAATAATAACAAATCGATGCTAGAAATTCAAAGAATGCAACATGTTCTCGAAAAAAAATTATCCTCATTGGATATCAACAAATTATCTATTGGCATTAGTGAATCCCATCAAACTCCCCAAGATTACATTGATTCTTATTCTGAAGGATGTCATTAA

Protein sequence:

>DPOGS201757-PA
MHHMYPMAKGDPLCPLGFHPQVRWPTRCKRCFRDYKEHGGRKKEDDFAASTPSLSSWNSPSSRSRDENNGAEVEKTGRGWASSTNLSITEVSKKDDITIGLNKKSTSWTSTPDLGSNEEDNTTAVTFSLKLPKRRSTGPLPSLDTGQNNTETVTVRRPSPSPPPTVKQPDSQPSIAQITINKNDSLAERVRKMQLMKAQSSFDKESSVEKERERRSASRSKEEEKSVKPKEEKSYSKEDVSISLGKTRNQSISKPPIRGGPISRDKEESEDEGSTITTDTDVTLVDPNIKEYQEQIESLKSEVDFLKKRCERVEKEKSDILLRRLANIDNTNKYSTGRSSENLKLQKKVNELTTQNEDLKDEKKHLSLRIKEMEVELESRPSIEAQTKQIEQLRAKLLAAETLCEELMDENEDMKKELRDLEEEIEEMQDNFREDQADEYSSLRRELEQTIKNCRVLSFKLKKTERRAEQLEQEKADQEKKLLEIVGGADGLQRENRIKELEQEVARSTEVALRLQRELAEAKTKSSSVSGVPPANVKKQTTDGKISRSSLTRGGSQEDAAQLLRDLQDSLEREADLREQLRNAEEEAIRYPGSFSDKRNLPPPHSPPTQLHPPPNLAVTDNNNKSMLEIQRMQHVLEKKLSSLDINKLSIGISESHQTPQDYIDSYSEGCH-