Monarch geneset OGS2.0

DPOGS203437
TranscriptDPOGS203437-TA3666 bp
ProteinDPOGS203437-PA1221 aa
Genomic positionDPSCF300242 - 202206-208491
RNAseq coverage298x (Rank: top 37%)
Annotation
HeliconiusHMEL0150243e-14236.54% 
BombyxBGIBMGA011156-TA2e-2649.21% 
DrosophilaCG16742-PA5e-0738.46% 
EBI UniRef50UniRef50_F0ZDD54e-1037.19%Putative uncharacterized protein n=1 Tax=Dictyostelium purpureum RepID=F0ZDD5_DICPU
NCBI RefSeqXP_969485.12e-1226.39%PREDICTED: similar to Protein FAM21A [Tribolium castaneum]
NCBI nr blastpgi|3800273341e-0941.67%PREDICTED: uncharacterized protein LOC100869481 [Apis florea]
NCBI nr blastxgi|2420202741e-3721.74%Tanabin, putative [Pediculus humanus corporis]
Group
KEGG pathway 
Orthology groupMCL21908 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203437-TA
ATGGAGGGAGATACTACTCGTCTGAGGTTGTCGGCTCCCGACTGGTCTCTCGCAGGAGACTCACAGTTACTGGACATCTTACAGAGCTTGCATCAGACCATCATAACCAAATGCCAGGAGACCAATGTTCAGTTGGAGTCCATGATGTCGTCCCTGGATGAGGCCAGCATTCACTTGCAGAATGTTAACAACAAATTCCTCGGACTCAGTAACAGTCAGTTTGTCGAAAGTCGCGTGTACGATGATCACACGGAGATCGCTGAAGATAATAACAACAAGGATCCTCCGCAGCGCGCCCCCCTCAGTCCCGTGTCGTCTTTGAAGCTCTGTCTCCACACCCTGGAGAGTCTTCACGAAGCTGTCCCCGTGATGGACTCCGACAGTGACGAAGAGGGCAGCGCCCGCGTGGTGCTCCGGCCTCTGCTGCGTCGCGGGCGCGTCGCTCATGAAGCTGTATCATCAGACGCGGACAGCGAGGAGTCCTCCCAGCAGGAGAGGCAGCTGGAGGCGGAGTACTCGGACTCGAGTTCTGAACACGAACAACAGACGGACGCACACGAACATACTATTCCACCGCCGCCTTCCGTGTTAGTGACGTCACACACGCCGCCCGACACGAGGACCACGGAGCCGGTCACCTCGCCCGAGTCGAATGTATCGCCAAAAGTGAGGAAGCTGTACACAGTAGACAAACCCGTCACAGCTCAAATCTTCCCCGAGGAGCCTCCGCCTCTTGACAAGTATGACTCCGACACTGACGATGACATCTTCGCTGACTTACACACACATGCACATACACATACACACACCCACACACACACAGCGCCAGACACGGGCGACATCGTAAACGACCTGTTCGGAGGAGGAGGAGGGGGAGGAAGAGCAGGGTTTGACAGAGATGACGTCACAGAACACACGCGAGTGAGGCATTCACACTTTGTGAGAGAGGAGTCGCCTGGAGCGACCAGTGTGGAGCCAGTGGAGCCAGAGTCAGAACAGACTCCGCCACGGGAATATACTACAAAGGAAAATGTTAAAAAACCCGCTGGTGGTATATCTCTGTTCGGCGGCGCGGGTCCTGAAGCTATCGGAGCGGCCGTCCTGCGAAGAGCACGAAGACAGTCATCAAGTGACGGTGAGGTCGCGGACACTCGCACCGACAGAACCAATGTCATCGACGAATTATTTATAAAACCAACTAAAAATGTCAAAAAACCACCCGTCGATGTTAAGAAAGAACCGAAAGTTGCTAAAGATATAGCTGAGAGTAGCGCTAAAGATAAAAAAGATAAAATAGATCTGTTCTCTGATGATATCTTTGATGACATCGATGATATCTTTACGAGTAACGTTACGAACACGACAAAAGACAGCAAGGAAACGTTGTTTAATGATGATCTGTTCAATGATAACAATGATCTGTTTAACGATAACAGTAAGTCTGTTAAGATTGAGAGCAGCGTTACTAAAGACGACAAAGTAAGAAACATATTTGATAGTGACAGTGAAGACGATTTGTTCTTTGATGCTAAAGGAAAAGATAAAGATTCAGACACAAAAGATAGCACTAAAGTTAAAGATTATAATTCAAATGAAAGCTTAACAGTCAAGAACACTAAAGAAGAAAGTAAAGTTGAACTGAAAAATCAGTTGAGTCCCAATTTATTTGATGATGATGATGATGACCTGTTCAATGTGACGCCGTCCAGGAGAGTGGCGAGTGAACACGGTGATAGGAACGCTGAAGAAACACGAGATAATCAAAGACAAGACAAGAATGAGGCCGAAAAGATGGAAGGAATCAAGACAAGTGACACGCAGGGGGAAGACTGTTTGGAAGAAAAACATGTCGGTGATCCCGTGACAACTGAAAGAAGTGATGCAAACATGCGCGTTCCAGAAAAAAACGTTGTACGAAACGAATTTCACGACGATTTTAATGATTCTGGACCAATAGAGGAAGATTCGGCAAAATCTACTGATAGAGAAGATAATGATGAGAAATCAAAAGGTGATAATAGTCTGCCGAAAGAAAAAGACTTTATAAAAGAAACGAAAGAAAATAAAAGTGAGGAGGAAGCGATAGATGTAAAAGATACTAACGCCAATGACATATTCGTCGACATCTTCAGTGATCTGCCTCCAGCCTTCGAGAAACCGATTGAACCGAAGAAGAGTAAAAACGTCAATGCTCTGTTCGACGATGACTCTGATGATGAGGCGCTGTTCTTCAAGAAAGATGACGTCATCACCGACGAGAAACCGGAAATGGACTTCGGCAGTGACAGGTTTAGAATATTCCATGACGAACCACCCGATATTGATGTGGATTTCACAACGAAGTCTGCGAGCGGACCTCATACGACTGATGTGGCAGATGCTTTGGAAGCTGCGGCGGACGTTGAAGCTGTGACCCATGGAAAAGCTGACACGGCATCAGAAAAACAGATTGAAAAATGTAAAGAGACGGGAAACGAAAATATGCCTCATGAAACAAAAAACAACAACAAAATAAACATACTGAAATTACTAGAGAATGAGGAAAACAATACTGATGGAGGAAATGAAAAGAAAGAGGATTTATTTACCGGCACAGAAAAAGATGATGCGAACTCTGCGAGTAAAACAAAAACAAGAGATGTCAAGACAGAAGAAGAATCAGACTCCTCGGAAAGAGAGAATAGAGTTATTGGAAAGCTGAAGCCGACGAAGCTCAATATAAATGTTAATACGTTGTTACCGGGAGCTGTTCCGAAGAAACCTGTGAACTACGAAGAGACCGACGGACAGGTCACATCCAGAAGTAAAGAAGACTCCGCTCTGGTTGAAGAGCACAAAGAAAAAGTAGTCAGCTTCAAGGAAGAAACGAACTCGGAAGTCCTAGATAACAAACTATCCAAGGAGAGAGCTCGGATTCAGGTCAAAAGACGACCGTCGACTAGACGAGCTAGACTTGAAGCTGTGAGGAAGACTGGTCTAGACTTCGGGTCAGACTCCACAGACAACTCCAGCTCGTTTGACGAACCGGTCAGAGAGATACCAAGAGACAGCGCTCCTAACAAAGAAACAACGACGAAAGTGACCAAACAAGCAGACAACAAAGATGTCATCTCTAAAGTTGTTTATGTTCTGAACGACGAGGACATCTTCGACATTCCTCCGACAGAAACAACTGCTGGAAAACCTCGGAAAGAAGATCTCACGGAAACAATGAACTCTACTGGAATCAGACACCAAGAAACACAAGGAGACGAGAGTCGGAAAAAGAAGACAGAAGAAAAGAAAACAAAAACATCATTATTTGATGATAGCGACGAGGAAACGGATCTGTTTGGGAAACACACTAAGAGATATATATTCGACTCGGACAGCGACAGCGAACTGTTCGGGAAAGATAAAGGAAAGATAGTGAAAGATACAAGAACAGAGGAAAAAGATAAAGAAAGGAGAATCGACAAGGTACAAGCGAAAATACCTCTGTTCAGTGACGACAGCGATGAAGACTTGTTCGGAGGAAAATCAAAAAAAATAGAAGTAAAGAACACATCACAAGCGAGAGCAGTCCCTGGATCATCACAAGTGAGAGCAGTCCCTGGATCATCACAAGCCTTCGATGATCCGCTCTCAGTGCTCGGGGACGAGCGCTCACACAACGTGCATATATAG

Protein sequence:

>DPOGS203437-PA
MEGDTTRLRLSAPDWSLAGDSQLLDILQSLHQTIITKCQETNVQLESMMSSLDEASIHLQNVNNKFLGLSNSQFVESRVYDDHTEIAEDNNNKDPPQRAPLSPVSSLKLCLHTLESLHEAVPVMDSDSDEEGSARVVLRPLLRRGRVAHEAVSSDADSEESSQQERQLEAEYSDSSSEHEQQTDAHEHTIPPPPSVLVTSHTPPDTRTTEPVTSPESNVSPKVRKLYTVDKPVTAQIFPEEPPPLDKYDSDTDDDIFADLHTHAHTHTHTHTHTAPDTGDIVNDLFGGGGGGGRAGFDRDDVTEHTRVRHSHFVREESPGATSVEPVEPESEQTPPREYTTKENVKKPAGGISLFGGAGPEAIGAAVLRRARRQSSSDGEVADTRTDRTNVIDELFIKPTKNVKKPPVDVKKEPKVAKDIAESSAKDKKDKIDLFSDDIFDDIDDIFTSNVTNTTKDSKETLFNDDLFNDNNDLFNDNSKSVKIESSVTKDDKVRNIFDSDSEDDLFFDAKGKDKDSDTKDSTKVKDYNSNESLTVKNTKEESKVELKNQLSPNLFDDDDDDLFNVTPSRRVASEHGDRNAEETRDNQRQDKNEAEKMEGIKTSDTQGEDCLEEKHVGDPVTTERSDANMRVPEKNVVRNEFHDDFNDSGPIEEDSAKSTDREDNDEKSKGDNSLPKEKDFIKETKENKSEEEAIDVKDTNANDIFVDIFSDLPPAFEKPIEPKKSKNVNALFDDDSDDEALFFKKDDVITDEKPEMDFGSDRFRIFHDEPPDIDVDFTTKSASGPHTTDVADALEAAADVEAVTHGKADTASEKQIEKCKETGNENMPHETKNNNKINILKLLENEENNTDGGNEKKEDLFTGTEKDDANSASKTKTRDVKTEEESDSSERENRVIGKLKPTKLNINVNTLLPGAVPKKPVNYEETDGQVTSRSKEDSALVEEHKEKVVSFKEETNSEVLDNKLSKERARIQVKRRPSTRRARLEAVRKTGLDFGSDSTDNSSSFDEPVREIPRDSAPNKETTTKVTKQADNKDVISKVVYVLNDEDIFDIPPTETTAGKPRKEDLTETMNSTGIRHQETQGDESRKKKTEEKKTKTSLFDDSDEETDLFGKHTKRYIFDSDSDSELFGKDKGKIVKDTRTEEKDKERRIDKVQAKIPLFSDDSDEDLFGGKSKKIEVKNTSQARAVPGSSQVRAVPGSSQAFDDPLSVLGDERSHNVHI-