Monarch geneset OGS2.0

DPOGS212167
TranscriptDPOGS212167-TA3621 bp
ProteinDPOGS212167-PA1206 aa
Genomic positionDPSCF300038 + 759542-772219
RNAseq coverage167x (Rank: top 51%)
Annotation
HeliconiusHMEL0125500.070.42% 
BombyxBGIBMGA006614-TA7e-16965.45% 
Drosophilacorn-PC1e-2430.63% 
EBI UniRef50UniRef50_B0W6W45e-4431.23%Cornetto n=2 Tax=Culicinae RepID=B0W6W4_CULQU
NCBI RefSeqXP_001844448.19e-4531.23%cornetto [Culex quinquefasciatus]
NCBI nr blastpgi|1700331632e-4331.23%cornetto [Culex quinquefasciatus]
NCBI nr blastxgi|1700331631e-4630.05%cornetto [Culex quinquefasciatus]
Group
KEGG pathway 
Orthology groupMCL24884 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212167-TA
ATGGCGGAAGAAAATAAAAATGCTCTCGTCGCGTGCGATCCGCCTCCTAATTTTAATAATTTGGATGCAACTACAAATTCAAACCAGCCTTTAGAAAGTAAAGGACACAATATAATTGATATTTCCATTGCTGCCGTAAAAGAAGACAGTGATATTGTTCCTGTCAAAGTAGATGATTCTAAAAATGCAAATGTCAATGTAAAACAAGATTTAGTAAGAAACGGTTCGTGCGACGCTATAGATGACGTACGATCTTTGGGGGACCTTGACAGTTTGCCTGTCGGAGATGACTTGGTGTTAGGCGCAGCGGGAAGCGACAGCGGTGTAGAAGGATGTGGCAGAGCTCTCAGTAGCGGAGGTGGATCCAGATCTTGTGCATCTAGTGTTGTTTCATGTGGATCTGGATGCGGATCAGAGAGCTCTTCGTTAGCTGGGGCACCTCCGCGACCACGTCGTCGAGTAAACATCACCGTTCAAGAACCTAAGAGAAGTTCACCGGTCAATTTATCATCGCGACCCATAACAGCGCCTCGTGGCCCTAATTTGGCAACTCGCGAACGCGCAAGAAGTAGGGAAAAACCAACACCCCCAGAAAAACCGCGACCTCTAACACCTAAGCCCAAAATACGCCCCACAGCTGATCTTCCCAACTTAGTTCGAGAAAGTCCAGCTCTTCGTGCAAAACCAACAAAAACTTCCACAGCAAGATGTCGAACTCCAAATTCTCCCGTGGATGAAAAGAAATGGCCAACCAATGGTCACGTGCAGCGACTACCAAATGCAACTGATGCCTCAGCAACCCGCGTCGCTGCTGATAAATATGGAACATTACCACGAAGGAGAAGAGACGCTGATCCGGAGTCATCTCCAAAACATGAAAGTATACCGCCCACCTCAAGAAGGCCGACCGTCACACGATCTGTTTCGTCTCGATCAACTAAAACCCGTGTAAGGATTTACGCTGAAAAAACATGCCAGACGGTTCTCGTCGGGGCTGATATTGAATCGGCGCTCGCTGGATTCGTGCCAAACATTGAAAGACGGGACGTTACCCTTTGTCACCGCGGAGTTCAGGCTAGCTCACGGGATACTGAGACAGCTCGGCTAGTAGCAGCAGTAGCGGCGGCAGAGGCAGCAGCAGCTGAGGAGAAGGAGCGAAGACAGCATGTCGAGACGCAGTTAGCCGCTGAACGTTCTGCAAGACTCGCTGCCATTTCAGAGCTAGAGAGAAACTCCCAGAGATTGCTCGAACTCGCCGGCGCTGGCGCTGGCGCTGGCGCTGAAGGTTGTCTGCGAGCTTTGGAAGAACAGCTCCGATCAGCGCGGGAACTGTCAACAAGACAGCGGACGGAAATAGACTCGCTCAGGGAACACTGTGATAAACTACATGCGGAGTGGTGTCGCGTGCGCGAGTCGCGCCGCGTGTTGTCCGCTCGTCTGTCGGAGGCGGAGCGCGAGGCCGCTGAGATGCAAGACTTCCTCGCCGCGGAGACCGGAGCCCTGGGGGACTCGCTCAGGGACGCCGAGGCGGAGATTGAAAAATTAGCTTCTGAACTAGAACGAAGACGCGGCGAGTGTCGTCAGTTAGTCCGTATGTGCGAGCAGCGCCGCCAGGAGGCGTTAGCGGCGAGCGCGCGGGCTCGGCGCGGCGCGGGCGCGGCGGCCGCTCTGGACGCGCTCGCCCGCCGTTTGCACGCGCTCACCGAGGCCGTGCGTGCGGCGTACCAGCTGCCCGCACACGTCGTGCACCCGACAGTCTTCCACAACGAGGCGTATTGCAGCCGTAGCGACAGCGGCGAGACCCTGTCTCCGTCGGAGGAGCCGCTCGGTCTCCTGGGAGCGGTGTCCCGGGCGCTCCGCTCGGCCTGCACACCTCTCGTGCATATGAATCAGCACGAGGACGACCGCTCCAGAATACGGGACGTTACCCTTTGTCACCGCGGAGTTCAGGCTAGCTCACGGGATACTGAGACAGCTCGGCTAGTAGCAGCAGTAGCGGCGGCAGAGGCAGCAGCAGCTGAGGAGAAGGAGCGAAGACAGCATGTCGAGACGCAGTTAGCCGCTGAACGTTCTGCAAGACTCGCTGCCATTTCAGAGCTAGAGAGAAACTCCCAGAGATTGCTCGAACTCGCCGGCGCTGGCGCTGGCGCTGGCGCTGAAGGTTGTCTGCGAGCTTTGGAAGAACAGCTCCGATCAGCGCGGGAACTGTCAACAAGACAGCGGACGGAAATAGACTCGCTCAGGGAACACTGTGATAAACTACATGCGGAGTGGTGTCGCGTGCGCGAGTCGCGCCGCGTGTTGTCCGCTCGTCTGTCGGAGGCGGAGCGCGAGGCCGCTGAGATGCAAGACTTCCTCGCCGCGGAGACCGGAGCCCTGGGGGACTCGCTCAGGGACGCCGAGGCGGAGATTGAAAAATTAGCTTCTGAACTAGAACGAAGACGCGGCGAGTGTCGTCAGTTAGTCCGTATGTGCGAGCAGCGCCGCCAGGAGGCGTTAGCGGCGAGCGCGCGGGCTCGGCGCGGCGCGGGCGCGGCGGCCGCTCTGGACGCGCTCGCCCGCCGTTTGCACGCGCTCACCGAGGCCGTGCGTGCGGCGTACCAGCTGCCCGCACACGTCGTGCACCCGACAGTCTTCCACAACGAGGCGTATTGCAGCCGTAGCGACAGCGGCGAGACCCTGTCTCCGTCGGAGGAGCCGCTCGGTCTCCTGGGAGCGGTGTCCCGGGCGCTCCGCTCGGCCTGCACACCTCTCGTGCATATGAATCAGCACGAGGACGACCGCTCCAGAATGTCCGATGACAACGACAACTCCGCAGACCTGCTAGACTCAGAAACGGAGCCCTGTCTCGTCACTGATCCGGAATACGCCGAGGACTGGTGGTCGGGGGCTGAGGGGGTGGAAGGGGGGGACGGCGCCGCCGGGTCCAGCAACGATGACCTCAGCCCGGAGAGGGAATCAGCAGATGTCGACAGAGAGGTGGAGGGGTCCGCTGAGTCTTCGGTGTCGGAACGGGAGTCGCTTCGAAAATTGTCAGCGGCCATTTCGAGGCAGCGCTGGGAGGCCGAGGCCGAAGCGGACGAGGCGGCTCGTGGAGCTCTGTTGGACAGAGTGTTGTTGCTGGACTTACGCCTGGCGGACCTCTTGCGCGCTCTGGCAGCCGCCGCCTCCGCCTCGGCGGCCTCGACCACCACCCCCGCCTCGACTTCTTCACCAGATACCACCGCTGCGCTCGTTAACGCGCTCAGAGACAAAATAGAAGTTACAGAAAAGAATGTTGCCGACGTTGTCGTCAAGAAACTCGCCGAGCTTGAAGCTTGTAAGAACCTCATAGAACAGTATCAACAAAATATAGAAGCGCTCAAAAACCAGGTCGTTCAGCGCGTTGAAGAAGACGATAGCGAGCTAATAGAACAGGAATTAAACTCTTGTGATCCAAATGAGCGAAGTCGAAGATGCGAAGCCGTGGAAGCGGCGTTAAGCGCTCTACCAGCACGTCCGCGACTGTTGCCGCTCCGCACCCGACTGCAGCGTTTGGCGCGAGCGTTAGCGCCCGCGTCTCCCCCCACCCCGCCCCCCGCGCACACTTCACCCGCCGCTACGCCAACCGCCTGCACATAA

Protein sequence:

>DPOGS212167-PA
MAEENKNALVACDPPPNFNNLDATTNSNQPLESKGHNIIDISIAAVKEDSDIVPVKVDDSKNANVNVKQDLVRNGSCDAIDDVRSLGDLDSLPVGDDLVLGAAGSDSGVEGCGRALSSGGGSRSCASSVVSCGSGCGSESSSLAGAPPRPRRRVNITVQEPKRSSPVNLSSRPITAPRGPNLATRERARSREKPTPPEKPRPLTPKPKIRPTADLPNLVRESPALRAKPTKTSTARCRTPNSPVDEKKWPTNGHVQRLPNATDASATRVAADKYGTLPRRRRDADPESSPKHESIPPTSRRPTVTRSVSSRSTKTRVRIYAEKTCQTVLVGADIESALAGFVPNIERRDVTLCHRGVQASSRDTETARLVAAVAAAEAAAAEEKERRQHVETQLAAERSARLAAISELERNSQRLLELAGAGAGAGAEGCLRALEEQLRSARELSTRQRTEIDSLREHCDKLHAEWCRVRESRRVLSARLSEAEREAAEMQDFLAAETGALGDSLRDAEAEIEKLASELERRRGECRQLVRMCEQRRQEALAASARARRGAGAAAALDALARRLHALTEAVRAAYQLPAHVVHPTVFHNEAYCSRSDSGETLSPSEEPLGLLGAVSRALRSACTPLVHMNQHEDDRSRIRDVTLCHRGVQASSRDTETARLVAAVAAAEAAAAEEKERRQHVETQLAAERSARLAAISELERNSQRLLELAGAGAGAGAEGCLRALEEQLRSARELSTRQRTEIDSLREHCDKLHAEWCRVRESRRVLSARLSEAEREAAEMQDFLAAETGALGDSLRDAEAEIEKLASELERRRGECRQLVRMCEQRRQEALAASARARRGAGAAAALDALARRLHALTEAVRAAYQLPAHVVHPTVFHNEAYCSRSDSGETLSPSEEPLGLLGAVSRALRSACTPLVHMNQHEDDRSRMSDDNDNSADLLDSETEPCLVTDPEYAEDWWSGAEGVEGGDGAAGSSNDDLSPERESADVDREVEGSAESSVSERESLRKLSAAISRQRWEAEAEADEAARGALLDRVLLLDLRLADLLRALAAAASASAASTTTPASTSSPDTTAALVNALRDKIEVTEKNVADVVVKKLAELEACKNLIEQYQQNIEALKNQVVQRVEEDDSELIEQELNSCDPNERSRRCEAVEAALSALPARPRLLPLRTRLQRLARALAPASPPTPPPAHTSPAATPTACT-