Monarch geneset OGS2.0

DPOGS204260
TranscriptDPOGS204260-TA1320 bp
ProteinDPOGS204260-PA439 aa
Genomic positionDPSCF300046 - 263443-266167
RNAseq coverage48x (Rank: top 71%)
Annotation
HeliconiusHMEL0151931e-7751.96% 
BombyxBGIBMGA007533-TA2e-5640.84% 
Drosophila% 
EBI UniRef50UniRef50_UPI0001CBAE032e-1328.19%UPI0001CBAE03 related cluster n=1 Tax=unknown RepID=UPI0001CBAE03
NCBI RefSeqXP_002736948.14e-1428.19%PREDICTED: glutamine rich 2-like [Saccoglossus kowalevskii]
NCBI nr blastpgi|2912340158e-1328.19%PREDICTED: glutamine rich 2-like [Saccoglossus kowalevskii]
NCBI nr blastxgi|3838607302e-1321.97%PREDICTED: uncharacterized protein LOC100883897 [Megachile rotundata]
Group
KEGG pathway 
Orthology groupMCL27811 Specific divergent
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204260-TA
ATGATAGACCTGGCGATCGGAACACCAGAGGTGGGTGTTGTGGATTTTGGCATGCTGCAAACTATTCTACACTGTTTGGCTCAACAATTGAAAGTGTTGGGTAAAAATGTTGAGTTAAGAGGCAGCGTGGCCATGCTGCCAGTGGGACGGGAGAATGTCCATACTATTTCTATCAGCGAATACACTGTGGAGGCAGATGATAAGGTACCTAAAGAGAAAAAGACAGAGCTTAGCCGTACATCGAGTGTGACGGATACAATATTGGTCGTGGAGCGAAAAAGAAAGTCAGAAACAAGTAGGCCGACTGTGACCCAAACTAGTCGTAGTCCCACTCGAGGTCCGTCTCCACCCCTGCCAGCCGCGATGCCCCAGGACCTTGCTGCTGAGAAGCTTTCCCTCGTCACTACCAGCAAGTTCAATATTCTGGAGAACACTGTAAACGACCTTAAGGATCGTGTGTATGGAAGTATGCCTAAGAATGAGGAAATATTGGAGGAAGTTAGATCCCAAACAAATCTCAAAGCCATCACAGACATGTGGACGTCGCTGAACGTATCCTCTCGCTTGGAAGCAGCCGAGGCTGGTCTCGCTAAACTGAGCTCGCTCGTTCAGGATATCATATCCGAAAACGCTGGGCTTCAGGAGCGAATTGATGCGTCGCCGTCTGTAGCACCATCAACAATGGCTCCACCACCGAGGGTTTCGATAGCACCGCCACCAGCGCCAGTAGTATCCACAGTAGCTCCACCAACTGCTGCCCCCTCATCAATTGCCCCGCCATCTACAGCTCCACCGACGGCTGCACCGTCTATGGTGGGCGGTCCCCAGCCATCCTTTGATACATTAGCCACCAAGGATGACCTAAAGCAGCTCCAGGCTCATTTAGAACGAATCCGTATGGACCTCGACAACGTCTCTAACACTCTGTACAGCGCGATGGCCGACATGCAAGCAGGACTGGGCGAAGAGCCGGTTCCTCCTCCTCCTTCTACTCCTCCTCCAACGACCGTGGTCTCTACTACCGAACCAGCCCCACCACTAGCACCAAAATCAGCACCGTCTATAGCGGGAAGACGAAAATCTGATTTCCCACGTCGTTCTTTCTCACAAATAACTGGCAATCTGATGGAGAGAATAAACGATTTAGAAAAACGCATAAACAATTGCTGTCAGGATATAGGAAAAAATGATGGAGTTCTTCAAGATCAGATCAACTCGTTCCAAGATCAATTGGACTTCCTGCACAAAGAAGTGGGACAACTGCGAAGCGTGCAATCAGAAACATTATCACCAGATCTTGTTAAAGATTTGCAAGGCTAG

Protein sequence:

>DPOGS204260-PA
MIDLAIGTPEVGVVDFGMLQTILHCLAQQLKVLGKNVELRGSVAMLPVGRENVHTISISEYTVEADDKVPKEKKTELSRTSSVTDTILVVERKRKSETSRPTVTQTSRSPTRGPSPPLPAAMPQDLAAEKLSLVTTSKFNILENTVNDLKDRVYGSMPKNEEILEEVRSQTNLKAITDMWTSLNVSSRLEAAEAGLAKLSSLVQDIISENAGLQERIDASPSVAPSTMAPPPRVSIAPPPAPVVSTVAPPTAAPSSIAPPSTAPPTAAPSMVGGPQPSFDTLATKDDLKQLQAHLERIRMDLDNVSNTLYSAMADMQAGLGEEPVPPPPSTPPPTTVVSTTEPAPPLAPKSAPSIAGRRKSDFPRRSFSQITGNLMERINDLEKRINNCCQDIGKNDGVLQDQINSFQDQLDFLHKEVGQLRSVQSETLSPDLVKDLQG-