Monarch geneset OGS2.0

DPOGS206950
TranscriptDPOGS206950-TA2049 bp
ProteinDPOGS206950-PA682 aa
Genomic positionDPSCF300001 - 270055-281027
RNAseq coverage1426x (Rank: top 9%)
Annotation
HeliconiusHMEL0143630.076.79% 
BombyxBGIBMGA012958-TA2e-14073.66% 
DrosophilaCG12075-PB1e-4956.05% 
EBI UniRef50UniRef50_D2A4Q81e-8745.95%Putative uncharacterized protein GLEAN_15324 n=2 Tax=Tribolium castaneum RepID=D2A4Q8_TRICA
NCBI RefSeqXP_972221.11e-8845.95%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|910843172e-8745.95%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|910843177e-9942.81%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL26025 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206950-TA
ATGGCGTTACGTTTAAAAAAGGACATAAAGAAGGCTTCTTACTACGTGTGGTTCCTTGGAGCTCAAGAGTCTCGCGGGTTGAGGGGTCAGGAGTTCGCAGTTCCAGCGATACGGCTGTTGGAAGAAAGAGCCAGGGACTTGGAGCCATTCAAAGTAACACTGCAGGTCTCCCACAAAGGGCTTAAAATCATTCAGAATGTGACAGCGAAGGGCAAGCAGCAGACTATCAAGCACTTCATACCACACGGAAGCATCACCAGCGCTGTGGTACAAGGAGACGTGGTTGCCTGCGTCCTGCTGCTGTATAACCCGATCACCGGCTGTCCAGTTCATGTCCATGCTTACAGATGTGATTCGGATCACACTGCAGAAATGCTGTATACCCACCTTCTCGCGTTGATCGAACGCCCTGAGAACCAAAAGAAGTTTGCTGATATAGAGAAAAAACTTCAAATGCGAGGAGCGTTACCAACCAAGAAACCTCCGGACTCGTCCCTTGGCAGCGAAGTTTCCAGGGAGTCTGACTCCGGCAGTACTGAAGATAGAAACGTTGCTAATCTATACGATAGCCTCGCTGCTGAGCTCAAACAGAAGTTGTCTACAGGTAAAAAGGGAATAGGTAAAAGTCCTATCCTTCTTCCACCTCGAGATTATGACACGGTACATCGCCAGAAAGGCAACCTGAGTAATATAGACACGCGTCGCTGTCTAAACCAAAACATCGTAGGGATCAACGCACGTCGGAAGTTGGAATCCTCAGGCGGCAGTTCCGGGATTGGCAGCGATCTGGCTCCATCCCCCGAGAGGAACGAATACCCACGACATGACAATCACAGCACAAGCGAAGAGGATTGGGCTGAGAGCACTGCTTATATGATGCATGAAGCTTTTGATGCTTCTCCTCGGCGAGCTTCACCTCCTCGTCGGATTTCACCTCCTCGTAGAGCTTCACCGATGCGTCGGACTTCACCTACTCGTCGCCCATCACCAACACGTCGCCCATCACCAACACGTCGGCCATCACCAACTCGTCGGACTTCTCCACCTCGTCGTGCACTCTCTCCTCGCTACGACAGATCTTATAATGATGATTTCAATGATCATTTCCGCGACCAACTGGCTCCCTTTCGCGATCAAACTTTTGAACGTCGTTTCCGTCATGAATCTCTGGAACGCGTTAACAAGGATAAAACAGATAAATTTGAAGAGGAACCTGTAAACTATCGACGCAGGTTCATTCCAGACACTAAACCTTCCAACCGTAGCATAGACAGAGTAGAGGATCGACGATTTGGCAAACTGCAGCGGGGGGCCAGCGAAGGTAGTCTAAAAGTAAATGATGTCACCCCTAAAGAAAGATTCAATGTGGCTAAAGAAAAGTTTATGAATCTAGAAAGGGAAAGGTTTAACAGGGAACTGGAAGCACACATGGCAATGAGACGTAGCATGCTAGAAAGAAACGGTCGCTCGACTTCTTCTCCAGAGGAAGAAATTAGGGATGAGCGTCCAGAGCGTCACAGAGAATCTGGTGCTCGCAGAGAAAACGATCGCTCTCATAGAGACTTGGATCGTTCACATAGAGAACCAGAACGACGCAAAGAAGAGAGAGTGCGTGACTTGGGAACAACCGATTTAGGTCATAGAGAAATAGAAAGACTACCTCGTGTTGGACGAAAACGTGATGAAGTCAAGAGGTACGAGTATGGGACAGAAGATTATTTGAACAGAAGCGAACCAAAAGCACATAGGGATGAATACGATCGGTATAGAGAGAGGAGAGAATTAGACCGCCGTATTGACGAAGACGAGGTTATAGAGCCAGTCATTGAAGAGAGAAAGAGAGATTGGACTGACAGACAAAGAGCGTTCAGTCGTTCCCGTTTGTTGGATGAACAGCGGCAGTCCTACGCCGAAGGAGACAGGGATAGGTTCTTCGACAGGGATTACAGAGATTTCCGTGATTTAGCCGAAAGACAACCAGCGAAGTTCAGACACAGCTACGCTGAACCTATACCTCGAGGCAGGCTCGGTACTGTCAGACCTTATTAA

Protein sequence:

>DPOGS206950-PA
MALRLKKDIKKASYYVWFLGAQESRGLRGQEFAVPAIRLLEERARDLEPFKVTLQVSHKGLKIIQNVTAKGKQQTIKHFIPHGSITSAVVQGDVVACVLLLYNPITGCPVHVHAYRCDSDHTAEMLYTHLLALIERPENQKKFADIEKKLQMRGALPTKKPPDSSLGSEVSRESDSGSTEDRNVANLYDSLAAELKQKLSTGKKGIGKSPILLPPRDYDTVHRQKGNLSNIDTRRCLNQNIVGINARRKLESSGGSSGIGSDLAPSPERNEYPRHDNHSTSEEDWAESTAYMMHEAFDASPRRASPPRRISPPRRASPMRRTSPTRRPSPTRRPSPTRRPSPTRRTSPPRRALSPRYDRSYNDDFNDHFRDQLAPFRDQTFERRFRHESLERVNKDKTDKFEEEPVNYRRRFIPDTKPSNRSIDRVEDRRFGKLQRGASEGSLKVNDVTPKERFNVAKEKFMNLERERFNRELEAHMAMRRSMLERNGRSTSSPEEEIRDERPERHRESGARRENDRSHRDLDRSHREPERRKEERVRDLGTTDLGHREIERLPRVGRKRDEVKRYEYGTEDYLNRSEPKAHRDEYDRYRERRELDRRIDEDEVIEPVIEERKRDWTDRQRAFSRSRLLDEQRQSYAEGDRDRFFDRDYRDFRDLAERQPAKFRHSYAEPIPRGRLGTVRPY-