Monarch geneset OGS2.0

DPOGS209264
TranscriptDPOGS209264-TA1482 bp
ProteinDPOGS209264-PA493 aa
Genomic positionDPSCF300111 + 455456-460987
RNAseq coverage445x (Rank: top 28%)
Annotation
HeliconiusHMEL0167400.079.84% 
BombyxBGIBMGA007044-TA2e-9059.69% 
DrosophilaCG16953-PB1e-3943.08% 
EBI UniRef50UniRef50_D6WDQ44e-7940.77%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WDQ4_TRICA
NCBI RefSeqXP_970651.13e-6343.78%PREDICTED: similar to GA14233-PA [Tribolium castaneum]
NCBI nr blastpgi|2700034881e-7840.77%hypothetical protein TcasGA2_TC002731 [Tribolium castaneum]
NCBI nr blastxgi|2700034882e-10243.33%hypothetical protein TcasGA2_TC002731 [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL15844 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209264-TA
ATGGCTCGTTTGTTACTGTTGCTATTTTGTAGTTTTATATTTGAAAAAGCTGTTTCTGCTCTGAAACTGTCGATAATAAGCGATGGTCCAGCAGTGCGTGGGTCTAATCTTACATTTACAGCCACTGTCGCTGGCTATCATGGCGAGAATCTTAAATATGTGTATTGGGATGATGCACAGCCCCAGCATACAGCGGAGTTTATTTTATCATCAAACACATCACAATGGATAGTTGAGTATCCAAGGTCAATGTATGAAGCTGGTGATTATACTGTCAAAGTTAATTTAATGAAAAGTTTCTATGGAATATGGTATCTAGTTACAACTTCAAGTACCAACTTTCAACTCACTGATTCACTTAATGGCAATTTGATTCTTAATCAGAACAATTTGGACAGGCCCAATAAGTTTGTAGCTGTCAATAAGACTGTGCATCATTACATACAGCTGCCAGAGAATGAGCTTGAGTTTCTTAAAAGAAATGCCTCTACAGTTGTAACATACTGGTTCATAGATTGCTTGTATATTGATCAGACAACAGATTTTTCTCTGAACTACACATACCAAGATGTGATGGCAGATCATTATATAGATGCTTTGGTTGTCGCCAACAATGAGCCGCTGCCACCAATTACCACTACTACTACAACCACCACAACAACTACTACCTCTACCACAACAACTACAACCACAACCACAACCACGACCACAACAACTCCAAAGCCACTTACATCAAGTACAATTCCTTCAACTACACCTGGCAATTTTAGCAATGATACACATAAAATCGCAAAGAGGTCTCTATCAAACAAGGCAAAGATAAGCCCCATCAAAGATTTTATTTGCCATAACACAAGTATAGTCCCTATTGGCGAGACTTACACATATGGACATTATCAGGAAACAATCAGTGTAAGAGAGCAGATATCAACACTTAATATAACTGGAATGAATTGGCTCCAACATGGTGATCTGTTGAACTTACAAGTGAAATACACTGGGACACCGCCCTTCGACTACTGTGCTATGTACAAAATGGGTCAATATAATGTAACAGGAAATGAGACATGTGCTGTCAAAACGAGGACCTTATCGAATTTGTTCCCACTAGTTCATTATTTCTCAGACAGTAACCAACATACAGTGGTTGTTGTTGTTGAAAATGAAATAGGCAAGGCTGTGTCTAGAGCAACCATCAATATTTATAAAGCGTCAGTCCACGCCCAGTTATCTGTAATAGTGGTGCCGGTGGTGTTCTGTCTGCTGGCTGTTATATTAGTGGTGTTTGGTATCGCGTTTTACCAACATCGCTCCAGATACACAGTGGAGGTGGCTGATTTTGATTTCGGACAACAAGTTAACTTTGACTATAAAACCTTCACTGAACGACTCAGGGACAGTTTCAGAAATGCTTTAAACTTCAGACAGCGAGATGACACCGAGCCACTTACATCCGACACAAGATATGATTCCATGCCATAA

Protein sequence:

>DPOGS209264-PA
MARLLLLLFCSFIFEKAVSALKLSIISDGPAVRGSNLTFTATVAGYHGENLKYVYWDDAQPQHTAEFILSSNTSQWIVEYPRSMYEAGDYTVKVNLMKSFYGIWYLVTTSSTNFQLTDSLNGNLILNQNNLDRPNKFVAVNKTVHHYIQLPENELEFLKRNASTVVTYWFIDCLYIDQTTDFSLNYTYQDVMADHYIDALVVANNEPLPPITTTTTTTTTTTTSTTTTTTTTTTTTTTTPKPLTSSTIPSTTPGNFSNDTHKIAKRSLSNKAKISPIKDFICHNTSIVPIGETYTYGHYQETISVREQISTLNITGMNWLQHGDLLNLQVKYTGTPPFDYCAMYKMGQYNVTGNETCAVKTRTLSNLFPLVHYFSDSNQHTVVVVVENEIGKAVSRATINIYKASVHAQLSVIVVPVVFCLLAVILVVFGIAFYQHRSRYTVEVADFDFGQQVNFDYKTFTERLRDSFRNALNFRQRDDTEPLTSDTRYDSMP-