Monarch geneset OGS2.0

DPOGS214141
TranscriptDPOGS214141-TA1926 bp
ProteinDPOGS214141-PA641 aa
Genomic positionDPSCF300014 - 1203384-1214327
RNAseq coverage2542x (Rank: top 5%)
Annotation
HeliconiusHMEL0128320.075.89% 
BombyxBGIBMGA006189-TA0.077.21% 
DrosophilaCG30194-PD7e-16144.29% 
EBI UniRef50UniRef50_UPI00022476020.054.62%UPI0002247602 related cluster n=2 Tax=unknown RepID=UPI0002247602
NCBI RefSeqXP_001603871.10.054.15%PREDICTED: similar to ENSANGP00000012858 [Nasonia vitripennis]
NCBI nr blastpgi|3454936030.054.62%PREDICTED: long-chain fatty acid transport protein 4-like isoform 2 [Nasonia vitripennis]
NCBI nr blastxgi|3454936030.054.62%PREDICTED: long-chain fatty acid transport protein 4-like isoform 2 [Nasonia vitripennis]
Group
Gene OntologyGO:00081522.6e-66metabolic process
GO:00038242.6e-66catalytic activity
KEGG pathwaynvi:1001202110.0 
 K08745 (SLC27A1_4, FATP1, FATP4)maps-> PPAR signaling pathway
InterPro domain[107-538] IPR0008732.6e-66AMP-dependent synthetase/ligase
Orthology groupMCL10110 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214141-TA
ATGGACGCACTTTTGGCAGCCCTCGTTGCCCTTATGGCATTAGCCGCCGCGATGGCCGCCGTGCTAAGCACGCTTTCTAAAGCGGCGATTTTCGCGATCCTTGCGATTGCTCCCTGCGTATATCGATACAGAAAAAGGATTTATGTCATTGTGAAAACTTTGCCCAGGGATGCCAAATTCTTGTGGCGGTATGCAAACGCTATGGTGCGGTCGAAGCGCTGGGGACGTAACAACAGCACGGTGGCGGAACTGTTTACCAAAAGGGCCCTAAAGACGCCGGACGCTCCATGTTTCATCATGGTCGAAGGACGGACTTGGACTTTTCGAGAGATAGCAGAAAATTCAAACCAAGTGTCTCGGGTGATGCAGGAACATCTAGGTTTAAAACGTGGTGATGTCGTGTGTGTGTTTATGCCCAACTGCGTTGAGTATGTTTACACGTGGCTGGGTATGGCGAAGCTCGGCGCCGTGTCCGCGCTTATCAACAGCAACCTTCGCCATCGTCCGCTGCTGCATTGTATCCAAGTGGCTAAAGCCAAGGCGATCGTATTCTCAGATTCGCTGGCTGGAGCTATATCGGAGTTGGGGGATCAGCTGCCTCCTGAGCTGAAGCTATTTCAGCTGTACGGCAAGTGCCCCCCTGGGGTTATTGACCTTCGGGCTGAGATGGATAAACAGGTTCCAGAATATCCTATAGTGACAGACAAGCCTCGATACAGAGATACGCTTCTGTATATTTACACGTCAGGCACGACCGGTATGCCAAAGGCGGCGGTATTGCCAAACTCTAAGTATCTGCTGATAGTAGTGGCGACGGTCCACATGCTCGGCCTCCGCTCGTCCGACCGCCTGTACAACCCTCTGCCGCTGTACCACCTGGCTGGCGGGCTGGTGGGGACCTGCGCCGCGCTCGTGGACGGCATCCCCACCGTACTGCGGTCCAAATTCTCCGCTACCCACTACTGGACAGACTGCATCAAGTATGATTGTACGGTGTCTCAGTACATAGGCGAGATGTGTCGTTACCTGCTGTGTGCTCCGTCCAGACCCACGGACACTCAACACCGCGTCCGCATCATGGTAGGGAACGGCATGAGGCCGGCCATCTGGCAGCAGATCGTTGACAGGTTCAAAGTACCTCAGATAAATGAAATATACGGCGCGACGGAGGGAAATGCAAACATAATCAACGTGGACAATACAGTGGGCGCTGTGGGCTTCCTGCCCAAGTTGGTGCCCACGTGGTTACACCCCATAGCACTGGTCCGAGCTGATGATGACGGGGACCTGATCCGCGGCCCGGACGGATTGTGCATCAGATGCCAACCCAACGAGCCCGGTATGTTCATCGGCCTCATCGCTCAGGGCAACGCGTCCAGGGAGTACTACGGATACGTTGAAAAGAGCGACAGTAACAAGAAGCTGGTCCGCGACGTGTTCTGTAAGGGAGACGCGGCTTTCGTCAGCGGAGATATTCTGGTGGCGGACGAGCTCGGGTACCTGTACTTCAGAGACAGGACCGGGGACACTTACAAGTGGAAGGGGGAGAACGTCGCCACGGCCGAGGTGGAAGACGCGGTGAGGGCGGCCATCGGACAGAGAGATGTCGTCGTCTACGGAGTCTCGATCCCTCAAACGGAAGGCCGCGCTGGTATGGCCGCTGTGAGTGCAGCCAGCGTAGACGGACGGTCGCTGGCGGTAGCGCTCGACCACGCGCTACCATCATACGCGAGACCGCTGTTCCTCAGACTCATGAAGGACATAGAGATCACCAGTACGTTCAAGTTAAAGAAGCGGCAGTATCAGAAGGAAGGTTTCGACCCGGACGTGATCCAGGACCCGCTGTTCTTCCGCTCCGGGGACGACTTCGTCCCTCTGACGTCACAACTGTTCGACGACATCTGCAACGGACGAGTCAAACTATAA

Protein sequence:

>DPOGS214141-PA
MDALLAALVALMALAAAMAAVLSTLSKAAIFAILAIAPCVYRYRKRIYVIVKTLPRDAKFLWRYANAMVRSKRWGRNNSTVAELFTKRALKTPDAPCFIMVEGRTWTFREIAENSNQVSRVMQEHLGLKRGDVVCVFMPNCVEYVYTWLGMAKLGAVSALINSNLRHRPLLHCIQVAKAKAIVFSDSLAGAISELGDQLPPELKLFQLYGKCPPGVIDLRAEMDKQVPEYPIVTDKPRYRDTLLYIYTSGTTGMPKAAVLPNSKYLLIVVATVHMLGLRSSDRLYNPLPLYHLAGGLVGTCAALVDGIPTVLRSKFSATHYWTDCIKYDCTVSQYIGEMCRYLLCAPSRPTDTQHRVRIMVGNGMRPAIWQQIVDRFKVPQINEIYGATEGNANIINVDNTVGAVGFLPKLVPTWLHPIALVRADDDGDLIRGPDGLCIRCQPNEPGMFIGLIAQGNASREYYGYVEKSDSNKKLVRDVFCKGDAAFVSGDILVADELGYLYFRDRTGDTYKWKGENVATAEVEDAVRAAIGQRDVVVYGVSIPQTEGRAGMAAVSAASVDGRSLAVALDHALPSYARPLFLRLMKDIEITSTFKLKKRQYQKEGFDPDVIQDPLFFRSGDDFVPLTSQLFDDICNGRVKL-