Monarch geneset OGS2.0

DPOGS207225
TranscriptDPOGS207225-TA1671 bp
ProteinDPOGS207225-PA556 aa
Genomic positionDPSCF300235 - 389354-395830
RNAseq coverage152x (Rank: top 53%)
Annotation
HeliconiusHMEL0126507e-11358.79% 
BombyxBGIBMGA008556-TA1e-11842.61% 
DrosophilaCG9009-PA5e-10238.20% 
EBI UniRef50UniRef50_Q7QCM81e-11041.12%AGAP002718-PA n=4 Tax=Endopterygota RepID=Q7QCM8_ANOGA
NCBI RefSeqXP_001656463.11e-11441.38%AMP dependent coa ligase [Aedes aegypti]
NCBI nr blastpgi|1571348293e-11341.38%AMP dependent coa ligase [Aedes aegypti]
NCBI nr blastxgi|1571348292e-11140.88%AMP dependent coa ligase [Aedes aegypti]
Group
Gene OntologyGO:00081521.1e-95metabolic process
GO:00038241.1e-95catalytic activity
KEGG pathwaytad:TRIADDRAFT_562024e-86 
 K01904 (E6.2.1.12)maps-> Phenylpropanoid biosynthesis
    Phenylalanine metabolism
    Ubiquinone and other terpenoid-quinone biosynthesis
InterPro domain[50-486] IPR0008731.1e-95AMP-dependent synthetase/ligase
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207225-TA
ATGCCGAACATGAGGTTAATGGAGCGTTTGTGGTTATTTTCGTCATCATTTCAGGACCATGTTGCTATTAGAGGTAACAACAAGTTGAAGACAGACTTTCGTTGCAACATAAAACTCAGGGAATGTGCAGTTACTAAGAAAAAATATACATACAAAAAAATAATTAAAAATACTGCAGTATTTGCGACGTCACTTCGAAAGAAGCTAGGCCTAAATCCTAATGATATTGTTATTTCATTATTACCAAACATACCTGAATATCCTGTTGTGGCTCTGGGAACTATACAAGCAGGATGTATTTTCAGCTCCGTTAATCCAATTTATAAAGAAGTTGAAATATGTCATCAAGTAAGCATTACAGAGCCAAAATTAGTCGTTACAATACCAGAATGTTACGATACTGTAGTGAAAGGTCTAAAAATGGCTAAGAGTCCAGCCAAAATTGTTCTTATTGATAGCCCTAACAAACCAGTACCAGAGGGCACTATAAGATATACTGAAATTGCTGAAACTGATGGCGTCGATTATGCTTTGCTTGACGGAGTGGAGAAAAACTTAGAAGATGTTGCTTTGATACCTTTTTCCAGTGGCACCACTGGACTTCCAAAGGGGGTTGAAATCACCTACAAAAACCTTATCGCTGCACTTGAAGTAATGGCGAAAGAAGAAAATTGCTTTCCCATTCTAACTAATGGTAATCAGCAAGATGTAGTGCCATGCATCCTTCCATTCTTCCATATATACGGGATGGTTGTAACTTTACTCGGACATTTTGTCAAGGGCTGTAAATTGATTACCCTACCGAGGTTTTCCGCAAACACCTATTTTGATGTGTTGAAGAATCAAAATCCCACGATATTGTACGTAGTTCCTCCTGTTGCAATACTCCTTGGAAAACATCCTGAGGTAACAAAAGATCACTTGAAACACGTTAAGTACATGGTATGCGGGGCGGCACCTCTATCTGCTTCCGATGCAAATGCGGTATTAGAAAAGAGCAATGTATTTCTTAATGATAAGAACTATACTGGAAATTTTCAGGGAAAATTAGAATTTAAACAAGGTTATGGAGCAACAGAAACTACATCGCTAACAACCTCAACTCTTATAGGAGCCACTGATATAGACTACTCATCCTGCGGGATGCCACTTGCACATACAGAGATCATGTTTCTTGGCAGTGATGGCAAGCCTGTTCCTATTGGCGAGCCTGGAGAACTTTGTACAAGGTCTCCAACTGTAATGAAAGGTTATTACAAAAATGAAAAAGCCACAAAAGAAAGTATGACAGACGACGGATTTTTCAAAACTGGTGATTTGGGACATTATGACCCCAAGTATGGCTTATATGTGACTGATAGAATAAAAGAACTCATAAAGGTTAAAGGAATGCAAGTAGCTCCGGCGGAATTGGAGGGCCTTTTAAGGTCTCATCCCGCAGTAGCTGATGCTGCGGTGATTGGTGTTCCACACGAGTATTTTGGTGAAGCACCAAAAGCTTTTATAATAAGGAAAGGTGGTCAAAATACATCACCGGAAGAACTGCAAGACTTTATTGCCAATAAAGTTGCGTCATTCAAGAAAATCGAAGAAGTGGTCTTTGTAGACGACATCCCAAAGACTACTTCTGGGAAAATATTAAGGAAGGAACTAAAAAAAATGTATGCGTAG

Protein sequence:

>DPOGS207225-PA
MPNMRLMERLWLFSSSFQDHVAIRGNNKLKTDFRCNIKLRECAVTKKKYTYKKIIKNTAVFATSLRKKLGLNPNDIVISLLPNIPEYPVVALGTIQAGCIFSSVNPIYKEVEICHQVSITEPKLVVTIPECYDTVVKGLKMAKSPAKIVLIDSPNKPVPEGTIRYTEIAETDGVDYALLDGVEKNLEDVALIPFSSGTTGLPKGVEITYKNLIAALEVMAKEENCFPILTNGNQQDVVPCILPFFHIYGMVVTLLGHFVKGCKLITLPRFSANTYFDVLKNQNPTILYVVPPVAILLGKHPEVTKDHLKHVKYMVCGAAPLSASDANAVLEKSNVFLNDKNYTGNFQGKLEFKQGYGATETTSLTTSTLIGATDIDYSSCGMPLAHTEIMFLGSDGKPVPIGEPGELCTRSPTVMKGYYKNEKATKESMTDDGFFKTGDLGHYDPKYGLYVTDRIKELIKVKGMQVAPAELEGLLRSHPAVADAAVIGVPHEYFGEAPKAFIIRKGGQNTSPEELQDFIANKVASFKKIEEVVFVDDIPKTTSGKILRKELKKMYA-