Monarch geneset OGS2.0

DPOGS211433
TranscriptDPOGS211433-TA1893 bp
ProteinDPOGS211433-PA630 aa
Genomic positionDPSCF300115 + 730936-735094
RNAseq coverage114x (Rank: top 59%)
Annotation
HeliconiusHMEL0175077e-16246.86% 
BombyxBGIBMGA004621-TA3e-7531.40% 
DrosophilaCG9447-PB1e-5226.99% 
EBI UniRef50UniRef50_A7URR77e-6126.56%AGAP007070-PA n=5 Tax=Anopheles gambiae RepID=A7URR7_ANOGA
NCBI RefSeqXP_001688060.11e-6126.56%AGAP007070-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582863433e-6026.56%AGAP007070-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1571365936e-6028.62%hypothetical protein AaeL_AAEL013595 [Aedes aegypti]
Group
KEGG pathwaydme:Dmel_CG333372e-09 
 K00680 (E2.3.1.-)maps-> Benzoate degradation via CoA ligation
    Limonene and pinene degradation
    Ethylbenzene degradation
    Tyrosine metabolism
    1- and 2-Methylnaphthalene degradation
Orthology groupMCL34888 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211433-TA
ATGCCGCCTGTCTTCCACTTGGATGATTATGACGACTGTATGATAAACCCCCAAGGCTTGTACTGCTTTGTAGACATCAATTTGGTTTCCGACAGTCCCAATGAAGTCTTAGTTATGATACAAAGATATTCTGCGCAAACAAAGAAACACTTCAATCACTCCCAACTTCATCACGGTAAATGTGTTACACAAAAATGTAAAAAATACATACAAGATCTAAATGAAACCGATTTGATCAGCAACCATCACCTCACGACTGCCCTGGAGGGATGTTTCAATGAATCCCTATGGCAGGACTATAGACTCAAAACGAGAGTATCACACGTATTTTGCGTCGATAACAAAAAGGAAGTTGTTTTTGACGCTGGCGATATAGCGTTAGGCGTATTTATATTTATTGTTCTGGCGTTAAACGTGATTGGAAGTCTCTACGACGTCGTTTTGATAAAAGGCAATAAAAATCAAGTCAATAAATTTCAAATCAAATATAATTTATATTTTACACGGTTAAAGATAATTCTAATAGCTGTTTTCTTATCAAAAACTCAATGTTTAATTATTTTATTGCCAATTTTTAGAACCATAACGATGATGTGCGTTATTTATTGTCATTCACTCATGATGTTTGCTTTTGAACCTGAAAATCCTCATCATATCGAACAGCTTTATGACAACATTCTACATTATATATTCCTGAATGGATTCTTAGTTGTCCAGACATTCTTCATAATATCGGGATGTCTCATATCTTATAAACTGGAATTGTACGCAGAGAAGCATAAAGTAAAATGGAATTTGATACCATTAGCGATTTTAATGACTTGGATAAAACTAACTCCCTCCTACATGATTATATTAGCAATTTCTACAACATGGTTGAGGCACGCTGGAACCGGCCCTTTCTGGGAGATAAGTGTTGCCAAGGAGGTGGAGGATTGTCGACAGAATGGTTGGACAAACCTATTGTATATCAACAATTATATGGTCAATTCCCAATGCCTTGGACAGTCATGGTATTTAGGGGCCGCCATGCAGCTAAGTATAGTAGGATACATTGTTTGCGTATTACCAAACACCGTTAAGGTCAGGAACATTGCGCTAACGATTCTCTTCGTGATTGGTGTTATAACACCAGCAGTTCACACTTACTACCAAGACTTGGATGCTGTTCTCATGATTACACCTGAAACAGCTCGAGTATTTGAGGGGAACCCCACATTTAATTATGTATACAGAATGGGTCACACAAACATCACGAATTTCATCGTAGGCATTGTTTTAGGATATTCCATATATAGATGGCAGAAAACCGGAGAAACATTTCAAAGATATAAAAAACATCGTTATTTGCTTCTGTTGACTTTCCCATCTATTATTGGAATGATGTTCATCGGGAGCGTATTTTACTTGGATCATCTTAACGCCCCGGTACTCGTCAAAGCTATATATGCTGGTATTGTTAAGCCAATATATGGAATCATTATGTCGGTACTAATTATAGCAGCCGTTTTTAAATTAGAAAATGTTTATCGAGCTATTTTCGAGTCGAGTATTTGGAGGATACCAGCAAAACTGACGTACTGTGCCTACATAATACACGTAACGATAATACGCGCTGTCGTCGGGAATCGGACAACCTTGGCTACATTTTCAAACATAAATATGATTGAATTCTCATTAGCATGTATATCATTGACATTCATATTGTCAGTGCCGTTTTGGTTACTCATAAATACGCCTCTCACTCAGCTACTTAAGTCGTGTGTTAGTTGTTTGTTACAAATGAAAGAGGCTCATGGAACAGAAAAAGATTCTAGAACGGATAAGAACAGTATTATTCTTGAAGTAAAGAATGAAATTCAAAAGGAAGTTGCTGAAAACGTTGCTTCGTAA

Protein sequence:

>DPOGS211433-PA
MPPVFHLDDYDDCMINPQGLYCFVDINLVSDSPNEVLVMIQRYSAQTKKHFNHSQLHHGKCVTQKCKKYIQDLNETDLISNHHLTTALEGCFNESLWQDYRLKTRVSHVFCVDNKKEVVFDAGDIALGVFIFIVLALNVIGSLYDVVLIKGNKNQVNKFQIKYNLYFTRLKIILIAVFLSKTQCLIILLPIFRTITMMCVIYCHSLMMFAFEPENPHHIEQLYDNILHYIFLNGFLVVQTFFIISGCLISYKLELYAEKHKVKWNLIPLAILMTWIKLTPSYMIILAISTTWLRHAGTGPFWEISVAKEVEDCRQNGWTNLLYINNYMVNSQCLGQSWYLGAAMQLSIVGYIVCVLPNTVKVRNIALTILFVIGVITPAVHTYYQDLDAVLMITPETARVFEGNPTFNYVYRMGHTNITNFIVGIVLGYSIYRWQKTGETFQRYKKHRYLLLLTFPSIIGMMFIGSVFYLDHLNAPVLVKAIYAGIVKPIYGIIMSVLIIAAVFKLENVYRAIFESSIWRIPAKLTYCAYIIHVTIIRAVVGNRTTLATFSNINMIEFSLACISLTFILSVPFWLLINTPLTQLLKSCVSCLLQMKEAHGTEKDSRTDKNSIILEVKNEIQKEVAENVAS-