Monarch geneset OGS2.0

DPOGS211171
TranscriptDPOGS211171-TA1134 bp
ProteinDPOGS211171-PA377 aa
Genomic positionDPSCF300007 + 314562-315721
RNAseq coverage102x (Rank: top 61%)
Annotation
HeliconiusHMEL0172289e-15065.25% 
BombyxBGIBMGA003156-TA9e-13156.15% 
DrosophilaCG9319-PA4e-10750.92% 
EBI UniRef50UniRef50_Q9VIK06e-10550.92%CG9319 n=12 Tax=Eumetazoa RepID=Q9VIK0_DROME
NCBI RefSeqXP_001974119.13e-10850.92%GG21551 [Drosophila erecta]
NCBI nr blastpgi|3407117971e-10852.82%PREDICTED: alpha-methylacyl-CoA racemase-like [Bombus terrestris]
NCBI nr blastxgi|3407117974e-10652.82%PREDICTED: alpha-methylacyl-CoA racemase-like [Bombus terrestris]
Group
Gene OntologyGO:00081521.1e-129metabolic process
GO:00038241.1e-129catalytic activity
KEGG pathwayder:Dere_GG215518e-108 
 K01796 (E5.1.99.4, AMACR, mcr)maps-> Peroxisome
    Primary bile acid biosynthesis
InterPro domain[1-364] IPR0036731.1e-129CoA-transferase family III
[1-369] IPR0236069.9e-94CoA-transferase family III domain
Orthology groupMCL11976 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211171-TA
ATGGCTTTAAATGGCATTAAAGTTGTGGAAATGCTGGGATTGGCTCCCGGCCCTTTTTGTGGAACTGTATTAGCTGATTTTGGAGCAACTGTCACAGTTGTACAAAAGATCGAAGTCACAACGTTGGATGTTTTGTCGAATGGGAAGAAAGCCATTTCTTTGAACTTGAAATGTCCAAGAGGTAAACAAGTATTTAAAAAATTAATGGAATCTTCAGATGTTTTAATTGATACATATCGACCAGGTGTTTTAGAAAAACTGGGTCTCGGCCCACAAGACCTAATGAAAAAAAATCCAAAATTAATATATGCAAGATTAACAGGATATGGGCAAACTGGATTTTACAAGGATAAAGCTGGTCATGATTTGAATTATGTTGCCATGTCAGGTGTCCTTTCAATGCTCGGCAAGGATGGACAACCACCTAGAGCTCCAATAAATATACTTGCAGATTTAGCTGGTGGTAGCCTGTCTTGTGTTCTTGGTATAATTTTGGCATTGTTTGAGAGAAGTAACTCAGGAAAGGGACAAATTATTGATGCAAGCATGACAGAAGGAGCTGCATATGTTGCAAATTGGATTTTTAAATCAAAAAATTTGCCAATTTGGATGGGGGAACCAGGAACTAGCATTTTGGATGGTGGATACCCCAGTTACCAAACTTATAAGACCAAGGATAACAAATTCATAGCAGTAGCCGCTTTGGAAGAAAAGTTCCATTTAATGTTCCTAAAAGGACTTAACATTTCAGAAGAAGACTACGCAGTGTGGGAAAAAAATGAATGTGAAAAGAAATTTAAAGAAATATTTTTAACTAAGACGCAACAGGAATGGTGCAACATCTTTAAAGAATTGGATGCCTGCGTCACTCCTGTATTAGATTTAGAAAATGTAAAAAACCATGACCTGCATCTCTCACGAAAGTCATTCTATGTAGATGAAAATAATTTAGTAGCTCCAGAACCAGCTCCTCGATTATCAAGAACACCAGGTTCAGCCAGTGGTAAGTTACCATCGGTGAAACCAGGTCAACATACAATAGAAATACTGACATCATTAGGGTATAAAAAATCAGAAATACAAGAACTTATTAACAACAATAGTGTCTATGCCTATAAAAAATCGAATTTATAA

Protein sequence:

>DPOGS211171-PA
MALNGIKVVEMLGLAPGPFCGTVLADFGATVTVVQKIEVTTLDVLSNGKKAISLNLKCPRGKQVFKKLMESSDVLIDTYRPGVLEKLGLGPQDLMKKNPKLIYARLTGYGQTGFYKDKAGHDLNYVAMSGVLSMLGKDGQPPRAPINILADLAGGSLSCVLGIILALFERSNSGKGQIIDASMTEGAAYVANWIFKSKNLPIWMGEPGTSILDGGYPSYQTYKTKDNKFIAVAALEEKFHLMFLKGLNISEEDYAVWEKNECEKKFKEIFLTKTQQEWCNIFKELDACVTPVLDLENVKNHDLHLSRKSFYVDENNLVAPEPAPRLSRTPGSASGKLPSVKPGQHTIEILTSLGYKKSEIQELINNNSVYAYKKSNL-