Monarch geneset OGS2.0

DPOGS200275
TranscriptDPOGS200275-TA4734 bp
ProteinDPOGS200275-PA1577 aa
Genomic positionDPSCF300026 - 859895-876636
RNAseq coverage189x (Rank: top 48%)
Annotation
HeliconiusHMEL0000120.074.81% 
BombyxBGIBMGA005549-TA7e-17365.49% 
DrosophilaCG2519-PA0.048.22% 
EBI UniRef50UniRef50_C9S2690.076.91%Similar to CG2519 n=3 Tax=Nymphalidae RepID=C9S269_9NEOP
NCBI RefSeqXP_001812287.10.047.07%PREDICTED: similar to AGAP004423-PA [Tribolium castaneum]
NCBI nr blastpgi|2613359350.076.91%similar to CG2519 [Heliconius melpomene]
NCBI nr blastxgi|2613359350.076.91%similar to CG2519 [Heliconius melpomene]
Group
KEGG pathway 
Orthology groupMCL15801 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200275-TA
ATGTCTGGGGAGGGCGAACGCGCTTCTCGCCCAAATTCCTTGTTACTGGAAACGCCTGGACCGGAACGCGAACCGCTTCGATTCGATGTACAGCTCGTGGGAGCGCCTCCAGAAGTTGAACAGCTAGTTAATAACATCAAACAAGTTTCAGAAGACTTTCTTTATCACTGGAAAACTTTTCCAATTGTTTTACCACCATCAAGATTCACTGGCCCAGACAATCGACCATCCGATATTATAATTGCACCTCCCTGTGATGAGTTGGATGCCGCAGCACTTGATGCTGGCATTGAACCACATCCCTTGACACCAAAACAACTACATGCAATCAGAGAAAAAGGCCTGCACAAGGACAAAATATATATGCCGAGGAAGTTAAATGCTAAACAATTGGAATCCATTTGTGAATGGGGTGAGTTCGAAGTACCATCGCGTCACTTCCCCGGCCAGACCCACGTGTGGAGAGTGGCGGGATGGCTCCAAAGAGGCCGTGCTCGTGCTAAGGAGGAACTCTACTGTCATGTCGCAAGAGCGGTGGCACTTATCGTGGTCGTATCAAGAGACAGACTGCTGAGAGACGAGCCGCTATCTTTGATAGCCGGTGGTAAAGCCTTCATGGAAGGATTACAACGATTAATCGATCTAATATTTGGTCTGCCGTCTTTGGAGGCTCGCGATCTTGAAAAGAAAATACGCGAGGAAAGATCACGTTATTTGGTAGCTGAGTTAATATGTAAGAACGAAAACCAAGAGGACATAGCAGCTTTAGCGGCGTGGGTGTCGCGGGCCATGAGAAGGGCGGCTTCTGAGAAGACGGACGGGCGGGAACCGCGTGCCATACCACGTGTACCTTACATATATACGACACCGCAGAGGACACAGGTCGATCTCCGTCTATACAAACGTGATCTGCTCCGTCGCGCCGCGTCCTCCCTACAGACCATACTAGAAAGGGAGTCGAGAGGATGGTTCTTACACTTCCGTGAACGTAAAGCAGCTCTGCTCGCCAGTCAGAAGATGCCCATAAAGGATATTGAGGAGGAGGTAAGCACGGCGGCCATGAAGGAAGCAGAAGAAGGAGTACGACGCAAGCTGGAGGCCGGCCTGTCTAGTGCCGAGGCTCGTATCATAGCGGCACATCCCATACTGTCACGTGCCGACACTTGGAAGAAGGAGAAGCTAGCGGCCGCCGCCCACGGACTAAGGAACGAACTGCGCTGGACGGCGATGGAAGACGCGGCAAATGCTATGCAAGTCCACAAACTGCATCAGCACAGATATTTCCTGCTGAGGGACTTGGCTTTCCTCAAGGACAGGGAACCATTGCTTATGAAGGAGCTTCGTGCAGCTAAAACTCCTACTCGGGAGTTCACCTGGGCTACCCGTATCTGGTTTCCGGATAACTGGACCATCATACGTCACTTCCGGGGTCGATCAGAGCGTATACCAACCGTCATCAGTGGCAGAGCTACCAACATAGTGACCCCCCGGTCGGACCCGTCCCAGCCTGTATTCCTCGCTGATCGCGAGAAAATACGGACTACTACAACAAGGTGGCCATTTTGGAGACTGCTCAATCTTGCTCACAGGAGCTGGTGTTGGACGTGGAACATGATGTTTGTGCTGGGAGTGCTAATCCCTTGGTGTTCTCCGCTGTCTCTACGGACGCTGCTCTGTGTCAAACCCTTTGTACCGGATCTAGAGCTGTCTCAGGTAAACGGAACGCTGTTCCCGAAACGTAGCAGTGAAACACAGACCATGTGGTCGCAACTGCTCAAACTGTGGCGACACGTGTCCAAAGAGAGGACACGCTTTGAGACGGAACCTGATACGGGACTTCTGGGTAAAGGTATGAGCAGACAGGCGAATCGCATATGGAACTACGGCGTGATAGGCGGCTGTGGATCTCTGGCGCTGCTACTGCTGTTCCCGTTGGTGTCACTGGCGGCCAGCTTACTGTCACTGGCTGTAGCGGCCAGTGTGCCAGTGTGGATGCCACCGCTGGCTGTGGCATTACACGCTGTCAATGCGCTGGTGTTCGATCTAGATTGTCCTGATCCACCGCGGCTTAATCGGGCCGCCCAGGCGTTGGCCGCGGTGTGTGCTCGAGGCGAGCTTGAATCCCTCGCCACGTGGGCGACTGAAGTAGAGGCCGCCATAGAACGGCCGCTCAGAGACTACGCTCACTTCGTAGACGCCTGCTTCGGACCATTCTCAGTACAGATAGCTAAAACGGGTGCTTACAAACAACTTGAAAAGGAATGCAGTGAACTAGTTTGGTCTTTGAGAGAGAAAGTGGCGACGAGGAAACGCGAACTGTCCCTGGGTCTGAGTGACGCAGCCAGGGCCAGGGTACGGATGCAGGCGCAGGATTTGAGGGTAGATAAATCGCTAAAATGTAACGTACGGGATGTAGGTTGTGATATACACGAGGTGAAGAAACGGAAGACAGATCTTAAGATAAGTCATACCGGTTTCTGCTTAGCAGAAGAAGGAGTACGTCGCAAGCTGGAGGCCGGCCTGTCTAGTGCCGAGGCGCGTATCATAGCGGCACATCCCATACTGTCACGTGCCGACACTTGGAAGAAGGAGAAGCTAGCGGCCGCCGCCCACGGACTAAGGAACGAACTGCGCTGGACGGCGATGGAAGACGCGGCAAATGCTATGCAAGTCCACAAACTGCACCAGCACAGATATTTCCTGCTGAGGGACTTGGCTTTCCTCAAGGACAGGGAACCATTGCTTATGAAGGAGCTTCGTGCAGCTAAAACTCCTACTCGGGAGTTCACCTGGGCTACCCGTATCTGGTTTCCGGATAACTGGACCATCATACGTCACTTCCGGGGTCGATCAGAGCGTATACCAACCGTCATCAGTGGCAGAGCTACCAACATAGTGACCCCCCGGTCGGACCCGTCCCAGCCTGTATTCCTCGCTGATCGCGAGAAAATACGGACTACTACAACAAGGTGGCCATTTTGGAGACTGCTCAATCTTGCTCACAGGAGCTGGTGTTGGACGTGGAACATGATGTTTGTGCTGGGAGTGCTAATCCCTTGGTGTTCTCCGCTGTCTCTACGGACGCTGCTCTGTGTCAAACCCTTTGTACCGGATCTAGAGCTGTCTCAGATTTTGGAATTGAGCTCGGAGCACAGACTTCTGGGTAAAGGTATGAGCAGACAGGCGAATCGCATATGGAACTACGGCGTGATAGGCGGCTGTGGATCTCTGGCGCTGCTACTGCTGTTCCCGTTGGTGTCACTGGCGGCCAGCTTACTGTCACTGGCTGTAGCGGCCAGTGTGCCAGTGTGGATGCCACCGCTGGCTGTGGCATTACACGCTGTCAATGCGCTGGTGTTCGATCTAGATTGTCCTGATCCACCGCGGCTTAATCGTTGGTTCGTGCTGTTCGAGGTGTTGATATGGCGTATTGGGTTGCTGGGCGTGCTGCAGCCGATTGTGGCGTTTATTGTGGCTGTCTTCGTGTGTCCGCTGACAGCTCTCATTATGCTAGTTTTGTGTGTCAGCTGGTGGTGTTTGCGCGGCGTGTGGGAGGCGGTGTCGTGGCGCGTGTTGTTGGTGCGAGGCGCCCGGGTCCCGGCTCACGACTCGCGGTTCTGTCGCAGAGTCGCCGGACCAGCTCTGCTCACACACGCCTCCTATCAGATCACGGCCGCCCAGGCGTTGGCCGCGGTGTGTGCTCGAGGCGAGCTTGAATCCCTCGCCACGTGGGCGACTGAAGTAGAGGCCGCCATAGAACGGCCGCTCAGAGACTACGCTCACTTCGTAGACGCCTGCTTCGGACCATTCTCAGTACAGATAGCTAAAACGGGTGCTTACAAACAACTTGAAAAGGAATGCAGTGAACTAGTTTGGTCTTTGAGAGAGAAAGTGGCGACGAGGAAACGCGAACTGTCCCTGGGTCTGAGTGACGCAGCCAGGGCCAGGGTACGGATGCAGGCGCAGGATTTGAGGAAAGCTGTCCATCTATCAGCGGTGGAACTGTCCCGGGTGTGGGGGGAGGCGGCCTCGAGGAGCGACGACTGGTGGACGGCGCGGGGACTCGAACCCGCGGACTGGCACGCACTCGCCGCCAACATGCTCGTTGAGGTATTCGATGCGGAGATTCTAGTCGCGTTGGAGGAGGGCGAAGCCCGAGTGTCACTGGAAGCAGGGCCGGGGGCTGCGAGGTGGGGCCGAACCGCACGCGACCACGCACCGCCTGACGTGCTGGCTGAGAGAGACGACTGGGGACCGGCGCCCGGTGTGTGGGGTGAGTGGTCGGGTACTCCGGCCCCCCGGGTGCCTCCGCCCTCGCTGGAGGTGTCCGCGTTCAGTCCCCGGACCCCCGCGCTGCCCCCCCTGCCGCCGCCCGCTGCCGTCGCCTTAGTACTGCATAACAGGGAATCGGACAATCCTATACAATTGGATTCAGAGCTCTGTACTGAAATACTGAAGACTCTAGAAGATGCGCCCGACTGTGACGACAGACGAGATGACGTGGAACGTTATCGCGGCGGTGGCAGCGAGGTCACCAGCGACTCCAGCGGCTCCGACACACCGGACGACGACGGACGGCAAGAACACGCCCCCGAATGCAGAATATCACCGGCCACAGAGCGCGCGGCCTGCAGATGGACTCTCACAGGGAGAGGAGTCCGTCTGAGGGCAGACCTGGCCAGCCCCGAGGACGTGACCCTGGACACGGACAGGCATGTCGGAACATCCGTTTGA

Protein sequence:

>DPOGS200275-PA
MSGEGERASRPNSLLLETPGPEREPLRFDVQLVGAPPEVEQLVNNIKQVSEDFLYHWKTFPIVLPPSRFTGPDNRPSDIIIAPPCDELDAAALDAGIEPHPLTPKQLHAIREKGLHKDKIYMPRKLNAKQLESICEWGEFEVPSRHFPGQTHVWRVAGWLQRGRARAKEELYCHVARAVALIVVVSRDRLLRDEPLSLIAGGKAFMEGLQRLIDLIFGLPSLEARDLEKKIREERSRYLVAELICKNENQEDIAALAAWVSRAMRRAASEKTDGREPRAIPRVPYIYTTPQRTQVDLRLYKRDLLRRAASSLQTILERESRGWFLHFRERKAALLASQKMPIKDIEEEVSTAAMKEAEEGVRRKLEAGLSSAEARIIAAHPILSRADTWKKEKLAAAAHGLRNELRWTAMEDAANAMQVHKLHQHRYFLLRDLAFLKDREPLLMKELRAAKTPTREFTWATRIWFPDNWTIIRHFRGRSERIPTVISGRATNIVTPRSDPSQPVFLADREKIRTTTTRWPFWRLLNLAHRSWCWTWNMMFVLGVLIPWCSPLSLRTLLCVKPFVPDLELSQVNGTLFPKRSSETQTMWSQLLKLWRHVSKERTRFETEPDTGLLGKGMSRQANRIWNYGVIGGCGSLALLLLFPLVSLAASLLSLAVAASVPVWMPPLAVALHAVNALVFDLDCPDPPRLNRAAQALAAVCARGELESLATWATEVEAAIERPLRDYAHFVDACFGPFSVQIAKTGAYKQLEKECSELVWSLREKVATRKRELSLGLSDAARARVRMQAQDLRVDKSLKCNVRDVGCDIHEVKKRKTDLKISHTGFCLAEEGVRRKLEAGLSSAEARIIAAHPILSRADTWKKEKLAAAAHGLRNELRWTAMEDAANAMQVHKLHQHRYFLLRDLAFLKDREPLLMKELRAAKTPTREFTWATRIWFPDNWTIIRHFRGRSERIPTVISGRATNIVTPRSDPSQPVFLADREKIRTTTTRWPFWRLLNLAHRSWCWTWNMMFVLGVLIPWCSPLSLRTLLCVKPFVPDLELSQILELSSEHRLLGKGMSRQANRIWNYGVIGGCGSLALLLLFPLVSLAASLLSLAVAASVPVWMPPLAVALHAVNALVFDLDCPDPPRLNRWFVLFEVLIWRIGLLGVLQPIVAFIVAVFVCPLTALIMLVLCVSWWCLRGVWEAVSWRVLLVRGARVPAHDSRFCRRVAGPALLTHASYQITAAQALAAVCARGELESLATWATEVEAAIERPLRDYAHFVDACFGPFSVQIAKTGAYKQLEKECSELVWSLREKVATRKRELSLGLSDAARARVRMQAQDLRKAVHLSAVELSRVWGEAASRSDDWWTARGLEPADWHALAANMLVEVFDAEILVALEEGEARVSLEAGPGAARWGRTARDHAPPDVLAERDDWGPAPGVWGEWSGTPAPRVPPPSLEVSAFSPRTPALPPLPPPAAVALVLHNRESDNPIQLDSELCTEILKTLEDAPDCDDRRDDVERYRGGGSEVTSDSSGSDTPDDDGRQEHAPECRISPATERAACRWTLTGRGVRLRADLASPEDVTLDTDRHVGTSV-