Monarch geneset OGS2.0

DPOGS206814
TranscriptDPOGS206814-TA1704 bp
ProteinDPOGS206814-PA567 aa
Genomic positionDPSCF300001 - 3788497-3799669
RNAseq coverage237x (Rank: top 43%)
Annotation
HeliconiusHMEL0176092e-5229.45% 
BombyxBGIBMGA013102-TA1e-14148.13% 
DrosophilaCG4576-PA2e-3122.95% 
EBI UniRef50UniRef50_Q16FG43e-3924.16%Putative uncharacterized protein n=5 Tax=Culicinae RepID=Q16FG4_AEDAE
NCBI RefSeqXP_001649634.13e-4124.25%hypothetical protein AaeL_AAEL014801 [Aedes aegypti]
NCBI nr blastpgi|1571071255e-4024.25%hypothetical protein AaeL_AAEL014801 [Aedes aegypti]
NCBI nr blastxgi|1571071253e-4024.08%hypothetical protein AaeL_AAEL014801 [Aedes aegypti]
Group
KEGG pathwaydme:Dmel_CG333379e-07 
 K00680 (E2.3.1.-)maps-> Benzoate degradation via CoA ligation
    Limonene and pinene degradation
    Ethylbenzene degradation
    Tyrosine metabolism
    1- and 2-Methylnaphthalene degradation
Orthology groupMCL30983 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206814-TA
ATGGTGCCGGTGCTGATGCTAGTACTTGGAGTAGTCGCGGCTACCGACAGAGACTTCACTGATGTAGAGCAACTACGGCTACCACGTCTGTTTCACTTGGATAATTACGAGCGATGCCTGGCGCAGCGCGGTGGACAGTACTGCCTTGGCAGCTTCCGCGTGAGTGCAGACCAGGAAAACCCAAAATATGAATTCATTAAGGAATTCTCCTCCAGTCCACAAAACTTCGACCGCACTCTGCTACATCGCGGGTATTGTGTATCTTCTCGTTGCCCATCGGAAGCTAACATCACGCTTCGGTTTGAGCGCTGCATACAGCAACACGTCCTGCCTCCTGGCTTCACTGCAGCCTTACAAACTTATTCATGCGAAACAAAAAAAGAAGTAGATTTCAAAATGGATATTCCACAGTTTGCGTTTCTGCTTGTGATTGGGATTATTTTATTTTTGAACTTGATGGGCACGCTGTACGATTTGTTAAAAGGAGGAGAAGCAAAGAGCAAGTTACTGATGTCATGGTCATTACGAGTGAATTGGCAACGTCTCACAAGCACTCATGATGATGGAGACCCTCGATTGACAGCTTTAGCGCCCATACAGGGTGTCAGAGTTCTTCTGCTAATACTTGTGATGATGACCCACGCTTCTGAAATACAACATAAGGTCTACCTATATAATCCTGAGTTTTTTGAGAAGGTGCTGACTTACCCCATCACAATGTTAATCAAGAACGGTTCGTCGATAACTCAGATATTCATCGTGTTATCAAACTTTCTTTTCGGGTATAGCCTTTTAATATACTCCAAAAACAAGCAACTAGGGTTATCCCAGCTGCCCGCCTGTATCATGCATCGAATAGCTAGGATAACTCCAATCCATATGTTAGTAGTAGGGTTCGCTGCAACATGGTGGCAAGAGTCGGGTTCTGGACCGCAGTGGGCCGCCACCATCGGCGCAGAAAGCCAAATCTGCCGCAAGAAGTTCTGGACTCATTTTTTCTTCCTTCATAATTTTATATACAAAGACGAACATTGCTTACTCCAAACATGGTTTTTAGCAGTTGATATGCAAGTGTATTTTGTGGCATCAGCACTTATGCTGTATATGATACAGAAAAAGAAGAATCGAATACAGATATTGACCTGCTTATTTATTCTGTCTTGCCTCTTAAATGCAGGACTTGCATATATAAATGACTGGAAGTCGCTTCTATACATCATGTTACCGGAGAACGTGCGCACCACCTTCCACGGAATCCCATCGTTTAGTCAATACTATATCTCTCCCTGGGGAAGCTTGCCATCCTGTTTCATAGGTCTCATCACAGCCTGTGTACACTTCGATATGCAGGAACACGGATACAAGATAGCCAAGCAGAGATGGTTCACGACGCTCTACCACTTATCCATTCCTCTTATCGTGTTGTGTTTGTTGGCTGGAAACGTGATGTTGCGTCACACATCTCGCGGAGCAGTTTCCTCTTTCCTTGCCGCTGAACGACCCACAGTCGCCTTTCTTGCTGCCATATGTATTCTGGGCATTGCCAACAATGTAGATAGCGGCGGACCTGTTATCGACCATATTGTGGACGTACTGCGCTGCGCTACCACTTACTCTTCTGGTGGAAGCGCCCCTGCAGCGAACGTTCAATTCGCTGCTTTCCTAATATCTTACGCTGTGATAATTCACTTGTGGACTGATTAG

Protein sequence:

>DPOGS206814-PA
MVPVLMLVLGVVAATDRDFTDVEQLRLPRLFHLDNYERCLAQRGGQYCLGSFRVSADQENPKYEFIKEFSSSPQNFDRTLLHRGYCVSSRCPSEANITLRFERCIQQHVLPPGFTAALQTYSCETKKEVDFKMDIPQFAFLLVIGIILFLNLMGTLYDLLKGGEAKSKLLMSWSLRVNWQRLTSTHDDGDPRLTALAPIQGVRVLLLILVMMTHASEIQHKVYLYNPEFFEKVLTYPITMLIKNGSSITQIFIVLSNFLFGYSLLIYSKNKQLGLSQLPACIMHRIARITPIHMLVVGFAATWWQESGSGPQWAATIGAESQICRKKFWTHFFFLHNFIYKDEHCLLQTWFLAVDMQVYFVASALMLYMIQKKKNRIQILTCLFILSCLLNAGLAYINDWKSLLYIMLPENVRTTFHGIPSFSQYYISPWGSLPSCFIGLITACVHFDMQEHGYKIAKQRWFTTLYHLSIPLIVLCLLAGNVMLRHTSRGAVSSFLAAERPTVAFLAAICILGIANNVDSGGPVIDHIVDVLRCATTYSSGGSAPAANVQFAAFLISYAVIIHLWTD-