Monarch geneset OGS2.0

DPOGS203312
TranscriptDPOGS203312-TA2247 bp
ProteinDPOGS203312-PA748 aa
Genomic positionDPSCF300003 - 1004931-1008571
RNAseq coverage47x (Rank: top 71%)
Annotation
HeliconiusHMEL0166402e-6145.77% 
BombyxBGIBMGA012299-TA5e-4635.57% 
DrosophilaCG42674-PC6e-0950.91% 
EBI UniRef50UniRef50_UPI00020639D82e-0835.00%UPI00020639D8 related cluster n=4 Tax=unknown RepID=UPI00020639D8
NCBI RefSeqXP_966998.22e-0943.43%PREDICTED: similar to GA20259-PA [Tribolium castaneum]
NCBI nr blastpgi|3407177284e-0836.50%PREDICTED: hypothetical protein LOC100648400 [Bombus terrestris]
NCBI nr blastxgi|3838577772e-1924.01%PREDICTED: uncharacterized protein LOC100882671 [Megachile rotundata]
Group
KEGG pathway 
Orthology groupMCL34493 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203312-TA
ATGATAGATAAGATAGTCCAGATCAGCAAATCTGTGAGCAAGAGTAGTGACGCTACGAGTCTCGTATTCCTGGTTTTGGATGACCACGGAACCTCGTGTTCGAGCTTCATGCTCTCCGTGATGCCTAAGGATCCAAATCCGCAGCAGACTTTAAAAACCATGGAGAATAAAGTGAAGGAGGCCAAGTATACTTACGAAATTGCTCTCTGGATGTCCCGGAATCCCACTCGAGATATCAGCGAAATGGATTTAGATCTCAAGGCGGAATACGCGCATTATAAAAAACCGAGCAACGCCTCTGAAGAAAGTCACATAGAGCAAGAGGCGCGTGAAAGCGTCCTCTGCATGTTACACAGGGGCATGGGGGCGAATCAAGATTCAAATGCCATGACATCTTCGTATTACAATTATGATACTATAGTTATGGGTGTCAGCGCCCGGCCACCCGGCCGAAACTACGTGCCACATCTGGTTCATCGAACAATAGCTAGCTCTTTACCATACCAACAAAGGCTACAGATACCGCAACACACCAGCACTATGTCACTGCAAGAGCCGGGCACGAGCGGCTTGCAAGTTCAACGTTCAAATACTTCTATGGAGTACCTCCAGCCACCGCCGCTCTCCTCCGGCGGATCAGCAGTCACTGTAAGAGTGATAAGCGAAAGCGAATCAGAGACGATGACACCACAGCGTATTTCGCTGCAGCCTTCTCCTACACCATCGAGAAGTTCGATGCAAAGTTCTAGTACGCATACATTGCGAGTCCAAGAACCAGAAACGAATGTGACCACTCTTATACATAGCCTTCCCGATTTAACGATAGAACCATCCACTCCTCGGACTACATCAAGTCCCACACACCCTATACACATATCAGCATCAGTCAAACTGTACAGATCCCACCAACAGTTATTAAGCCAACGCCAAAAAACACAAGGACAATATTTGACACCTGACCAAAGAGGAGCCAGCTATCCTCCACCAAGTCCCACGAGAGGTTCACTTAAGCGTGGTCTTGCGTTCTCGTATTCGTTCAAACATCCGCCCCTCTATAAAATGGGTCACGTAGCGAGTCAATCGACAACTCAATTCGAACCTGTCGCATCAACAAGTCAACAGGTTGAGAGAACAGAAAGTCCAAGACCTTGTTCGAGCTTAGAAAAAACCGAGAAGAAATCAAAGCTGACCAGCAGCAGGCGGAAGCGTCGGGACATAAGCCAAGACAGATACTCCGAGCCTTGCACCAGTCATCGAGATGAGAGCCCCGACCCCTTCCGCCGAGAGGAAAGCCCCAGCCCAAGCTACAGAGATGAAAGCCCTGGAACTAGTTATAGGGATGAAAGCACTGATCTCAGTGGTATTGACAGAAGTTCTAGCCTTAGCCATAAAGATGAAAGTCCTGGTACTAGCTACAGAGGAGAGAGCCCTGCATATAGAGATTATCGGCGAAGCTCCAGTGGAAGAGATAAGAGCTCTGAGCGAAGCCATGAAGAAAGTCCAGATCCAAGAAGTGTAGAATTAAGTCCCTCTTCTAGTTACAGAGAAAAAAGTCCCGAACGATATAGAGATAGTAGCGGTTCTGATCGGCATAAAGACATGAGTTCGAGCCCTAGTCGAGGAAGCGTTAGCCCTGACCCTGCGCAAAGAAAAAAGAGCTTCAGTAGCAAAGCTGTAAGCCTTAGCCCAATCGCAAGTACTGATGAAATTCCTGGACTCAGCATTGGTGACAGCTCTCGAACTAGTCTCAAAGAAGATGACTCGCAAAAGAGTCCGGATTATCGTCAAAAAGATAAAAGTCCCAGCGCACGCATTACAGATGAAAGCTGGAGCGGTAGTACAGAAAGATTAGAAATAAGAAAACCAGCACTTCGACCTACTCGAGAACAGACACCGGGTCCAAGTCGAAGAATCAGAAGTCTTAGCCCCGTGGTTATATGCGATATTCCAATATTTTATCAAGGAGAATCGAGCTCCAATATTATCCAACCTACACCTCTAAAAAGGGGGTTAGAAGAACCTAGTGCCAGTACTAGTAAGCCTAGTCCAAGGGTTAAAAGAAAAAGTTTCTTTCGACGTAGCAAAAAGAAGGAGAAGGGAACTTCCAGATCCATCCTAGAAAGTTGTATCTCGGGACAGAAATCTGATTCTGAATCTGAATACAGCAGCAAAGAACCAAGTCCATCCACAAGTGCAAAAAGCAAAACATCTGAAAATGGGAAGCGCAGGTGGTTTAGAAGACATTAA

Protein sequence:

>DPOGS203312-PA
MIDKIVQISKSVSKSSDATSLVFLVLDDHGTSCSSFMLSVMPKDPNPQQTLKTMENKVKEAKYTYEIALWMSRNPTRDISEMDLDLKAEYAHYKKPSNASEESHIEQEARESVLCMLHRGMGANQDSNAMTSSYYNYDTIVMGVSARPPGRNYVPHLVHRTIASSLPYQQRLQIPQHTSTMSLQEPGTSGLQVQRSNTSMEYLQPPPLSSGGSAVTVRVISESESETMTPQRISLQPSPTPSRSSMQSSSTHTLRVQEPETNVTTLIHSLPDLTIEPSTPRTTSSPTHPIHISASVKLYRSHQQLLSQRQKTQGQYLTPDQRGASYPPPSPTRGSLKRGLAFSYSFKHPPLYKMGHVASQSTTQFEPVASTSQQVERTESPRPCSSLEKTEKKSKLTSSRRKRRDISQDRYSEPCTSHRDESPDPFRREESPSPSYRDESPGTSYRDESTDLSGIDRSSSLSHKDESPGTSYRGESPAYRDYRRSSSGRDKSSERSHEESPDPRSVELSPSSSYREKSPERYRDSSGSDRHKDMSSSPSRGSVSPDPAQRKKSFSSKAVSLSPIASTDEIPGLSIGDSSRTSLKEDDSQKSPDYRQKDKSPSARITDESWSGSTERLEIRKPALRPTREQTPGPSRRIRSLSPVVICDIPIFYQGESSSNIIQPTPLKRGLEEPSASTSKPSPRVKRKSFFRRSKKKEKGTSRSILESCISGQKSDSESEYSSKEPSPSTSAKSKTSENGKRRWFRRH-