Monarch geneset OGS2.0

DPOGS203804
TranscriptDPOGS203804-TA2073 bp
ProteinDPOGS203804-PA690 aa
Genomic positionDPSCF300010 + 1725305-1734110
RNAseq coverage1646x (Rank: top 8%)
Annotation
HeliconiusHMEL0133040.072.32% 
BombyxBGIBMGA003709-TA0.079.28% 
Drosophilabgm-PA0.049.56% 
EBI UniRef50UniRef50_E2A0J90.051.40%Long-chain-fatty-acid--CoA ligase ACSBG2 n=8 Tax=Endopterygota RepID=E2A0J9_CAMFO
NCBI RefSeqXP_967873.10.055.36%PREDICTED: similar to AGAP008596-PA [Tribolium castaneum]
NCBI nr blastpgi|910760840.055.36%PREDICTED: similar to AGAP008596-PA [Tribolium castaneum]
NCBI nr blastxgi|910760840.055.36%PREDICTED: similar to AGAP008596-PA [Tribolium castaneum]
Group
Gene OntologyGO:00081521.6e-95metabolic process
GO:00038241.6e-95catalytic activity
KEGG pathwaymbr:MONBRDRAFT_385831e-165 
 K01897 (ACSL, fadD)maps-> Peroxisome
    Fatty acid metabolism
    Adipocytokine signaling pathway
    PPAR signaling pathway
InterPro domain[81-557] IPR0008731.6e-95AMP-dependent synthetase/ligase
Orthology groupMCL11303 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203804-TA
ATGCAGGTCCCAAATGGAGTTGAAAATCAGAAATATTTGAATGGTCCTGACCAAGTAATACCAGTGGATCACTACATATGTTGTACTCCGGGCGGACACGTGAAACTGCGTATGGGCTCGCGGGGAGTCGCTGCAGAGCCTCCGATATCCGTACCAGGTCTACTGAGCAGGACAGTGGCTAGATATCCGGATGCAACGGCCTTTGCCACAAAAAAATCTGATGGAAAATGGCATAGAACCACTTACAGGCAGTACCAGGAACGCGTAAGGACAATAGCAAAGGCCTTTTTGAAAATAGGTCTCGAACGTTTTCACTCCGTATGTATATTGGGATTCAATTCAGAAGAATGGTACATCGCCGACCTCGCAGCTATTCACGCCGGAGGTTATGCGGCTGGAATTTACACCACGAATTCTGCTGAAGCATGTTTCCATTGTCTGGAATCCTCACGAGCGAACATCTGCGCTGTGCAGGACAAGAAGCAATTAGACAAAATATTGTCCATACAAGGACGCTTGAAGCATCTAAAAGCTATCGTACAATGGGAAGGTCCGGTAGACACGTCTGTGCCTGGTGTCTACAGTTGGGATCAGTTGATGGAGATGGGAGCGAAGGAACCTGACACACAGTTGAACAACATCCTTAAGTCTATTGCCGTCAACGAGTGCTGCACTCTTGTATACACTTCCGGAACCGTGGGTCCACCGAAAGCGGTGATGCTCTCACACGACAATCTCACGTGGGATGCATTCGCGATCAGTGAAAGATGCGGTGATTTGAAACCTACATTAGACAAAATCGTTTCATACTTACCGCTGAGTCACGTTGCCGCACAGGTAGTCGACATTTATGCAACATTAACAAATGCTATTGAAGTATACTTCGCAAGACCTGATGCCCTCAAGGGGACGTTGGTTGAAACCTTGAGAGAAGTCCGTCCAACGAGGTTTCTAGCTGTGCCGCGAGTGTGGGAAAAGATGCACGAAAAAATAATGGCTGTAGGCGCTGCGAACAGCTCCTTCAAGAAGAGCATAGCAATGTGGGCCAAAGAAAAAGGAACGAAACACCACCTAGCAAGAATTAACGGTGCCCTTTCGTCCAGTCCGAGGGTTCTGCCAATAGACTTCAGTCTTTTGGAGTTACTTAAGTATTGGCAAACGGGTACAACTTGTGGTTATAAACTGGCGAAGTCGCTAATCTTCAGTAAGGTGCGTGACTCTCTCGGACTTGACCGTTGCCTAACATTCGTAACTGCGGCGGCGCCACTGTCGCCTGAGATCAAAAAGTTTTTCCTTTCCTTGGACATACCCATCATGGATGCCTTCGGGATGAGTGAAGCAGCTGGTGCTCACACACTTAGTATATATCCAAAATTCTCGCTGGATTCATCTGGAGAAATACTGGACGGAACCGAAACACGATTTGGGGACTCCATGAGTGTCAACGGCCCTGGTGAAATTCAAATGCGCGGTCGCCATGTTCTCATGGGCTACCTCAATGACGAAGAAAAGACGAAAGCCACTTTAGACGAAGACGGATGGTTACACTCTGGCGATGTCGGAAGATTGGATAGTCACAACTTGTTGTACATCACAGGAAGAATAAAGGAATTATTGATAACTGCTGGAGGAGAAAACATAGCTCCTGTTCTCATAGAGCAGGCGGTTCAATCCGAACTGCTACACGTAGGATATGCTGTGCTTGTGGGAGACAGGAAGAAGTTCCTAGCCATATTACTTACTTTAAAGGCCAAAGTGGATTCAAACACCGGTGACGCGTTGGACGAGTTAGACACTGAAACAAAGAAGTGGGTGGCGGGTCTTGGTAGCTCTGCCACCACCATCAGTGAAATCGTCAGGACCAAGGATCCAGTTGTGTATAAAGCTATAGAAGATGGAATCACTCGTGCTAACAAACACGCAATATCGAACGCTCAGAAAGTACAAAAATTCGCTATACTACCCGCAGACTTCTCAATGAACACCGGAGAACTAGGACCAACATTGAAAATCAAGAGGAACGTAGTATACGAGAAATACAAAGACATCATAGAAGACTTCTACAAAGATTAG

Protein sequence:

>DPOGS203804-PA
MQVPNGVENQKYLNGPDQVIPVDHYICCTPGGHVKLRMGSRGVAAEPPISVPGLLSRTVARYPDATAFATKKSDGKWHRTTYRQYQERVRTIAKAFLKIGLERFHSVCILGFNSEEWYIADLAAIHAGGYAAGIYTTNSAEACFHCLESSRANICAVQDKKQLDKILSIQGRLKHLKAIVQWEGPVDTSVPGVYSWDQLMEMGAKEPDTQLNNILKSIAVNECCTLVYTSGTVGPPKAVMLSHDNLTWDAFAISERCGDLKPTLDKIVSYLPLSHVAAQVVDIYATLTNAIEVYFARPDALKGTLVETLREVRPTRFLAVPRVWEKMHEKIMAVGAANSSFKKSIAMWAKEKGTKHHLARINGALSSSPRVLPIDFSLLELLKYWQTGTTCGYKLAKSLIFSKVRDSLGLDRCLTFVTAAAPLSPEIKKFFLSLDIPIMDAFGMSEAAGAHTLSIYPKFSLDSSGEILDGTETRFGDSMSVNGPGEIQMRGRHVLMGYLNDEEKTKATLDEDGWLHSGDVGRLDSHNLLYITGRIKELLITAGGENIAPVLIEQAVQSELLHVGYAVLVGDRKKFLAILLTLKAKVDSNTGDALDELDTETKKWVAGLGSSATTISEIVRTKDPVVYKAIEDGITRANKHAISNAQKVQKFAILPADFSMNTGELGPTLKIKRNVVYEKYKDIIEDFYKD-