Monarch geneset OGS2.0

DPOGS207224
TranscriptDPOGS207224-TA1503 bp
ProteinDPOGS207224-PA500 aa
Genomic positionDPSCF300235 - 406472-415131
RNAseq coverage285x (Rank: top 39%)
Annotation
HeliconiusHMEL0126533e-9061.87% 
BombyxBGIBMGA008555-TA6e-14456.90% 
DrosophilaCG9009-PA7e-8937.09% 
EBI UniRef50UniRef50_Q9VXZ81e-8637.09%BcDNA.GH02901 n=12 Tax=Drosophila RepID=Q9VXZ8_DROME
NCBI RefSeqNP_572988.12e-8737.09%CG9009 [Drosophila melanogaster]
NCBI nr blastpgi|1955667865e-8536.69%GD15826 [Drosophila simulans]
NCBI nr blastxgi|2700054486e-8338.85%hypothetical protein TcasGA2_TC007506 [Tribolium castaneum]
Group
Gene OntologyGO:00081523.8e-51metabolic process
GO:00038243.8e-51catalytic activity
KEGG pathwaytad:TRIADDRAFT_562023e-81 
 K01904 (E6.2.1.12)maps-> Phenylpropanoid biosynthesis
    Phenylalanine metabolism
    Ubiquinone and other terpenoid-quinone biosynthesis
InterPro domain[263-430] IPR0008733.8e-51AMP-dependent synthetase/ligase
Orthology groupMCL25503 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207224-TA
ATGGCAACGGTCTTACATCGGAGGGCTCTACGAGTGGTTAATAAAGGAAAACTTTTCTTAGCCACATTATCCCGAGATAACAGCACAGAAAAACATATCTTGAGATCAGACTCTGGTGGTATAGAGAAGCCAAGACAAACTGTCACTGAATTTGTGTGGCAAAACTTAGATAAATGGCCTGACAAAACATTAGCTGTTTGTGCTGTAACTGGCCGCGGTTACACCTACGCTCAAACACATAGATTATCTGTTTCCTTTGCAGCATCATTACTTAAGAAACTCAAACTTCAACACAATGATAAGGTTGCCATTGTCTTACCAAATGTTCCCGAATATCCAGCCATCGCTTTTGGTATTTTGGAGGCTGGCTGTATCGCTAGCATGATGAATCCTGCTTACACAGTTGATGAACTCAAACATCAAATAAAACTCGTCGAGTGTAAGGCAATAGTAGCATCCAAATTATCGTATCCAAATTTGTATAAAGCACTGCAAGAACTAAAAATGAACATACCTGTGATATTAATTGACAATGAAGATCTACCCGAAAATACTATAAAGTTTGCTGAACTCGCTGAAAACACAGACACGGATATATTGAAATCGGTAAAACGAAACATCAAAGACACAGCCATCCTGCCATTTTCCAGTGGAACAACTGGTTTCCCCAAAGCCGTTGAACTGACCCATGAAAGTATATGCGCTCTTAATAGCATGATATTGACTCCAGGAATTATAGCTGTCCAAGAAGCTACAGCGATCTTCCTGGGGAAGCACCCGGCTGTTACACCGCGGCACTTGGACTCCGTCATCGACATTATCTGTGGCGCCGCCTCCCTCTCTAGTGGAGACGCTATGGCTATTATTGAAAAGAATAAGAATTTAATCTTCCGTCAAGGCTATGGCCTTACTGAGACAAACGGTGGCGTGGCCATCGGTTATAACGACAATACAAATCACGATGCTGTAGGATTCCCTTTCCCGAGCAGCGAAATAAAGATAGCTGATCTGAGTACCCAACAAGCTTTAGGACCGGGACAGGAAGGAGAAATTTGGTACAGGGGTCTTAACGTAATGAAGGGTTATTACAAGAATGAAGCAGCGACCAAAGAGGTCCTTACAGAAGACGGCTGGTTCAAAACTGGTGACGTCGGAAAATACGATGAAAACAAATATTTGTATATTACTGACAGAATAAAGGAACTCATTAAGGTTAAAGGCTTCCAAGTGGCACCAGCGGAACTGGAAACGGTTCTTCGTAGTCATCCAAAGATCCTCGATTGTGCTGTTCTTGGTATCCCAGACCCTTTTTCCGGGGAAGTCCCCAAAGCATTCGTCGTCGTCCAACCAGGACAGAACATTAAGGGAGAGGAAGTTCTGGAACACGTTAACAGTAAATTGACACAGTTCAAGAAAATTAAGGAAGTCCAATTCGTTGACGCGATACCCAAAAACCCAGCTGGGAAAATAATGAGGAGACAATTGAAAGAGAAATATTGTTAG

Protein sequence:

>DPOGS207224-PA
MATVLHRRALRVVNKGKLFLATLSRDNSTEKHILRSDSGGIEKPRQTVTEFVWQNLDKWPDKTLAVCAVTGRGYTYAQTHRLSVSFAASLLKKLKLQHNDKVAIVLPNVPEYPAIAFGILEAGCIASMMNPAYTVDELKHQIKLVECKAIVASKLSYPNLYKALQELKMNIPVILIDNEDLPENTIKFAELAENTDTDILKSVKRNIKDTAILPFSSGTTGFPKAVELTHESICALNSMILTPGIIAVQEATAIFLGKHPAVTPRHLDSVIDIICGAASLSSGDAMAIIEKNKNLIFRQGYGLTETNGGVAIGYNDNTNHDAVGFPFPSSEIKIADLSTQQALGPGQEGEIWYRGLNVMKGYYKNEAATKEVLTEDGWFKTGDVGKYDENKYLYITDRIKELIKVKGFQVAPAELETVLRSHPKILDCAVLGIPDPFSGEVPKAFVVVQPGQNIKGEEVLEHVNSKLTQFKKIKEVQFVDAIPKNPAGKIMRRQLKEKYC-