Monarch geneset OGS2.0

DPOGS207082
TranscriptDPOGS207082-TA7263 bp
ProteinDPOGS207082-PA2420 aa
Genomic positionDPSCF300001 + 2633009-2648396
RNAseq coverage1568x (Rank: top 8%)
Annotation
HeliconiusHMEL0041440.087.56% 
BombyxBGIBMGA013047-TA0.075.10% 
DrosophilaCG17374-PC0.047.09% 
EBI UniRef50UniRef50_O016780.078.88%P270 n=18 Tax=Endopterygota RepID=O01678_BOMMO
NCBI RefSeqNP_001037478.10.078.88%p270 [Bombyx mori]
NCBI nr blastpgi|1129843400.078.88%p270 [Bombyx mori]
NCBI nr blastxgi|1129843400.078.88%p270 [Bombyx mori]
Group
Gene OntologyGO:00038240catalytic activity
GO:00090580biosynthetic process
GO:00167473.8e-134transferase activity, transferring acyl groups other than amino-acyl groups
GO:00164913.8e-134oxidoreductase activity
GO:00081524.7e-72metabolic process
GO:00167408.3e-47transferase activity
GO:00000368.2e-32acyl carrier activity
GO:00054883.1e-24binding
GO:00551143.7e-22oxidation-reduction process
GO:00082703.7e-22zinc ion binding
GO:00167881.7e-09hydrolase activity, acting on ester bonds
GO:00480378.8e-06cofactor binding
KEGG pathwayphu:Phum_PHUM4483900.0 
 K00665 (FASN)maps-> Insulin signaling pathway
    Fatty acid biosynthesis
InterPro domain[24-2421] IPR0007940Beta-ketoacyl synthase
[1430-1746] IPR0208433.8e-134Polyketide synthase, enoylreductase
[26-431] IPR0208412.6e-99Polyketide synthase, beta-ketoacyl synthase domain
[25-264] IPR0160384.7e-72Thiolase-like, subgroup
[25-260] IPR0140301.9e-67Beta-ketoacyl synthase, N-terminal
[521-835] IPR0140432.1e-66Acyl transferase
[16-431] IPR0160391e-63Thiolase-like
[707-822] IPR0012278.3e-47Acyl transferase domain
[1778-1957] IPR0139682e-42Polyketide synthase, KR
[519-805] IPR0160358.2e-39Acyl transferase/acyl hydrolase/lysophospholipase
[265-381] IPR0140312.6e-34Beta-ketoacyl synthase, C-terminal
[523-817] IPR0208015.7e-34Polyketide synthase, acyl transferase domain
[2012-2091] IPR0090818.2e-32Acyl carrier protein-like
[1771-1979] IPR0160403.1e-24NAD(P)-binding domain
[1777-1958] IPR0208423.9e-24Polyketide synthase/Fatty acid synthase, KR
[1571-1702] IPR0131493.7e-22Alcohol dehydrogenase, C-terminal
[1424-1567] IPR0110329.8e-21GroES-like
[644-706] IPR0160361.3e-11Malonyl-CoA ACP transacylase, ACP-binding
[2146-2393] IPR0010311.7e-09Thioesterase
[2017-2062] IPR0061638.8e-06Phosphopantetheine-binding
Orthology groupMCL10042 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207082-TA
ATGGCACCATCTCCAGTGTCGGAAGATAGATTGACGTCGGGACACCGTCTGTCTCACCCTCCGCCCGGAGATGAGGTCCTCATCACTGGCATCTCGGGATACTTTCCCGACTCTGACTCTGTTTTGCATCTGCAAGAGAACCTCTTCAATAAGGTGGACCTGATCTCTGGAGACGCCAGACGATGGAAGCTCGCACACCCTGAGATTCCGCAGCGTACCGGCAAGATCAATAACGTCAATAGATTCGACGCTTCATTCTTTGGGGTTCATTTCAAGCAGGCCCACACTATGGATCCCATGTGCAGGATCCTTTTGGAAAAAGCTTATGAGGCAGTCATAGATGCAGGCTTAAACCCAAAAGAGCTGCGTGGCACCAAGACCGGTGTATTCATTGGAGCATGTTTCTCGGAGTCTGAAAAGACCTGGTTCTATGAGAAAATGCAGGTCAACGGTTTTGGAGTGACTGGATGTTCCCGTGCGATGTTGGCCAACCGTATCTCATATTGGCTTGGCGTCACTGGTCCATCTTACACTGTCGACTCAGCCTGTTCTAGTTCGCTCTACGCTTTGGAACATGCGTTTAGGGCCATAAGAGATGGACATTGCGATGCTGCGATCGTCGGAGGATCGAATTTGTGCTTACATCCTTACGTTTCCCTTCAGTTTTCGAGGTTGGGCGTGTTATCAGCTGACGGTAGATGCAAAAGTTTTGATAACAGTGCCAACGGTTATGCCCGATCTGAAGCCATCGTCGTATGTCTATTACAGAAAGCAAAAGATTCTAGGAGAGTTTATGCTCAAGTATTACACGCGAAAACAAACTGTGATGGTTACAAGGAACAGGGTATCACATATCCAGCTGGAAACATCCAGAAGCTGCTACTCAACGAGTTCTACGAAGAATGTTCTATCCCTCCAAACACCCTCGAGTTCGTGGAGGCTCATGGCACTGGTACACGAGTAGGAGACCCTGAAGAGTTGGTTGCAATTGAAGATGTTTTCTGCACTGGTCGTTCTGAGCCACTTATGATTGGATCTATTAAGTCAAATTTGGGTCACAGTGAGCCAGCATCTGGCTTATGTTCAATTGCTAAGCTCTGTATTGCTTATTCCACTGGATACATTCCACCCAATCTACACTTTACGGTTCCACGTGAAGGAGTAAAAGCTCTGTCCGAAAATCGCCTGAAAGTCATCACCCAGAAACATCCTTGGAATAGAGGCATGTCTGGTATCAACAGTTTTGGATTTGGAGGAGCAAATGCGCACACCTTACTGAAGAATGTTGCTAGAGAAAAGGTCAACAATGGTATACCATCTGATGATTTGCCGCGTTTAGTGTGTGTCTCTGGTCGTACTGAGTCCGCAGTCGCGAAAATACTGGATGATCTAGAGTCAAGGACAGTAGATGCCGAACTGATAAGGTTGTTGCATGCCATCCATGACGACGATATCACAGGCCATGTAGTGAGAGGATACAGTTTGTTAGAATCAACTCCAGTGAAGAGCAAATCTGTCAGTCGAGACATTCAGTACTTCTCAGGTGTGAAGCGCCCTGTTTGGTTTGTGTATTCTGGAATGGGCTCCCAATGGGCTGGTATGGCGACGCAGCTTATGAGAATTCCTGTTTTCGCAGCTGCTATTAACAAATGCCACAAAGCTTTGGAACCTAAAGGAGTCAATCTAATAAAAACCATAACGGACCCTGATCCATCTATTTATGATAATATCTTGAACTCATTCATTGGTATCGCCGCTGTACAAATCGGTCTTACTGATGTCTTGAAAGCTGTAGGAATTGAACCCGACTTCATCATTGGTCACAGTGTTGGTGAGCTGGGATGTGCCTATGCAGATGGTTGTTTCACCGCGGAACAGATGATCCTCTCAGCGTATAGTCGTGGACTCGCCTCCATCGAAACCCCATTCATCAAGGGGTCCATGGCTGCTGTGGGCTTGGGTTATGCTCAGATAAAATCAATAATTCCACCAGAGATCGAAGTTGCATGCCACAATGGTCCGGACTCGAGCACGATATCCGGACCTGCGGAAATCATGAAGAGCTTTGTTGCAAAGCTTGTAGCTCAAGGGGTATTCGCTAAGGAAGTCCCTTGTTCTAATATCGCATACCATTCAAAATATATTGCAGATGCAGGTCCGAAACTATTACAATACCTGCGAGAAGTAATACCTCAGCCTAAGTCTCGATCCGAGCGTTGGGTTTCGACATCTGTACCGCAGATGCTATGGAAAGACCCCAGAGCAACTCTGTCATCCGCGGAATATCACACTAATAACCTCTTGAGTCCAGTGCTATTCGAAGAGACAGCTCGTCTGATACCTGGAAATGCAATAACAATTGAAATTGCACCCCATGGTCTGTTGCAAGCTATTCTTCGTCGTTCTCTAAAGAAAGACGTACTTAACATTCCTCTTACTGAGAAGAAGCACGCCGACAATGTTCAATACTTTTTAACGGCTTTGGGCAAACTTTACGAGGCGGGTTTGAACCCACATCTCGCGAACATCTACCCGCATGTTCCATTCCCCGTGTCTCAGGGAACACCCATGTTGTCCCACCTCGTCGAGTGGGAACATAGTGAAGATTGGTACGTAACATCATACAAAGCCCAAGATAAAATGAAGTCTGGTGAAAGGACAGTCAGGATGTCCATTGTTGATGAGGACTCTGAATTTATGGCTGGGCACGTGATCGATGGTCGTAATCTATATCCTGCGACTGGTTATCTGGTCATGGTTTGGGAGACACTTGGTATGATGTTGGGAGAATTATATACTGAGGTGTCGGTTGTCTTCGAGAACGTCCGATTCCAAAGAGCTACTAACATTCCTAAGGACGGAAATCTCGAGTTCATAGTTATGATCCAGAAGGGCAGTGGACAATTTGAGATCGTAGAAAGTGGTGCTTCAATCGTGACTGGTCGTATCTACAGCAAAAAGAATGTAGGACAGGACTTCCGTGTTTTACCTGCGTCACCTGAATCATCTGGACCCAATGTGAAACACTTGTTGACCAAGGACTTTTATAAAGAATTGCGTCTCAGAGGCTATCAGTATAGTGGACTCTTCCGAGGTATCATTGGCTGCAACGTCGAGGGTACTCGTGGTCGACTTGCTTGGCAGAACAACTGGGTGGCTTTCATGGACTGTATGCTCCAGCTCAAGCTGATTGGCCAGGATACAAGAGGACTTTTTGTACCAACAAGAATAGAAAGATTGAGTATTGATGCAAACATGCACTATGATGCGATATCGAAAATGAATCCTGATTCCGCACGCCAGTCATTTGAAATTAGGGTGTATCCGGAGGTTGAAGTGATAAGGGCTGGTGGTGTGGAAATTCGAGGTCTTCATGCAACACCAATATCAAGGAAGAATCCTCTTGGTGTTCCAGTACTAGAAAAGAATGTATTTGTACCCAATTTCGGCAAGTCCAAGATGAAGATTGAAGATATTCTACGTGCTAACATTCAACTGGTTCTTGAAAATATACAAACTTATAAAGTTAAGGCCATAGAAGTTATTGATGAAGAGTACAAAACTAATAACTACGAACCAATTATGGAAATCGTTGGTGAGGTACTCGGTGACTTGCCACTGATTCAAGCAGATATGACAGTATTCTCTGAGGAGCCTATAGAAATGCCATCTAACATTGTGGTAGAAAACAAGAAAGCTACAGCTGAATCGAATACTATTATATATATCGGAGCTAATTTATTAAGGCGCCCTGAAGTTCTTAAACCGGCACTTAGCACACTCCGAGACAAAGGTTTTCTGATTAGTCGTGAAAAAGACCTTATCAATCCAAAAGATTTTTCTGATAAATTTGATATAATAAGTATACAAGATACTGGAGTCGAATATTTGGTTCTATTCCGAAAACGAATTGGTGCCAAGCCCGCAAAATTTATTAAAATTGTAACTACCGATGATACCTATTCTTGGATAGATAAAGTGAAAGAAGAATTAAAGGTCGGTCAGAAGCTTGTTCTTTACAATGAGGATGAGCCTATTAATGGTCTGTTAGGATTAGTCAATTGTCTCCGAAGGGAACCAGGTGGCGAAGTCGTCCATGGTCTTCTCATTGCTGATCCTTCAGCTCCAGCATTTAACCCAGATTTGGAATTTTACGAGGAGCAGCTAGACAAAGATATGGCTTTAAATGTTTATCAAGATGGTCAGTGGGGCACATACCGTCATTTACTCCTAGGCGATCTGGATGTAGTGAATGCTCACCACGCATTCGTTAACACTACAACTATCGGTGATTTGTCCACCCTTAAGTGGCTTGAAGGACCAATCAGAGAGAACACAGTGTTTAAGGATCCATACAGCATTCTCATACATGTATATTGTGCTGCATTGAACTTCCGTGATGTCATGACAGCCATGGGTAGAGTGACAGTGGACGCTGTAGCGAGAGGGAGGTTGGCGCAAGAATGTGTACAGGGCTTTGAAGTCGTGGGAAGAACATGCAATGGATCAAGAGTTATGGCCATGGTAAGAAACAGCGGTATGGCAAACATGATAGAAGGAGACAAGGCTCTAATGTGGAATATTCCGGATGAATGGACCTTCGAAGAAGCGGCTTCTGTGCCTGTTGCGTACGGAACTGTGTATTATGCTATGGTAATGGTTGCCCGGGTGCAACGCGGAGAGTCAGTTTTGATTCACGCTGGGTCTGGTGGTGTAGGTCAAGCAGCTATTAACGTGGCCCTTCATTATGGATGTGAAGTTTTTACTACTGTGGGAACTCCAGAAAAAAGGGCATTCATCAAGAAGCTTTTCCCGCAGCTTAAAGATAGTCATATTGGCAATTCCCGTGACACATCCTTCGAAGACATGATCCGAAAAGAAACGAAAGGCAAGGGTGTCGACATCGTACTCAACTCTTTAGCAGATGAGAAGTTACAGGCGTCAGTCCGTTGCCTCGGGTACCGTGGTCGGTTCCTTGAGATCGGCAAGTTTGATATCAGTAACAATACTCCCATCGGCATGTACTTCTTTTTGAAGGAGACTTCGTTCCACGGCATTATGTTGGACTTTATCTTCGATCACAGTTATGAGTTCAGAAAGAGTCTACAAGATTTGCTGCTCAGTGGCATTGAAAGCGGTGCTGTACGCCCACTAACATACTGTACTTTTGAAAAGCACGAGATAGAGGCTGCATTCAGATATATGGCTGCTGGAAAACATATTGGAAAGGTTATTATTAAAATTCGAGATGAAGAAAGGTCACATCGCCCTGTGAAGCCAGTTACATTAAGAATACCAGCAACACCTCGTTACATGTGTATGGACGATTACGTCTACGTAGTTGTTGGTGGTCTTGGAGGTTTCGGTCTGGAGTTAACGGACTGGCTAATACTGCGCGGTGCCAGGCGTGTATTGCTCACCTCTAGGCGAGGAGTCAGTAATGGATACCAGGCCTCTAGATTAAGGACTTGGGCTCAATATAAGGCAGATGTGCAAGTTTCAACTCACGATATTACAACTGAAGAAGGTTGTCAAAAGATGTTGGAAATGGCGAACTCGATGGGTAAAGTCGAGGCTATATTTAATTTGGCTGTGATCTTGAAGGATTCCATCTTCCAAAATCAAACTCCTGAAACATTCAAAATATCTTTCGGTCCAAAAGCTAAGGCTACTATGAATTTGGATAAACTGTCTAGGAAACTATGTCCTGGTCTTAAGGACTTTGTAATATTCTCATCCGTATCCTGCGGTCGTGGTAATGCCGGGCAAACCAACTATGGATACTCAAACTCAGTTATGGAACGAATCTGCGAGTGGCGACACAAGCTTGGTCTTCCTGCGCTGGCTGTGCAGTGGGGCGCTGTGGGAGATGTCGGTTTGGTAGCGGATATGCAAGACGAAGATGTTCAACTAGAAATCGGTGGTACCTTGCAGCAGAGAATTTCCTCCTGCCTTCTATCTTTAGACAAATTCTTAAAACAAGATGCAGTTATAGTGTCCTCGATAGTTGTGGCTGAGAAGAAGGCCGGTGGTTCTGGTTGTGGCAACATTGTCGACGCTGTCGCACAAATTATGGGTATCAAGGATCTGAAGACTGTATCTCAACAAGTCTCCCTCGCTGAATTGGGTATGGACAGTATGATGGCTGTTGAGATTAAACAAACACTGGAAAGAGAATTTGAGATATTCCTTACAGCGCAAGACATCAGAACATTGACATTTGCACGATTAGTGGAACTGACGGCAATACGAGAGGCAGCAGCATCCACTTCGGCAACACCTAGATCTAATTCATCTGACATAACAGCTGGTCTGCGAACCTTTATAAGGAATTTCGGAAACGAAAACCTAGCCACAGAACCTTTCATTTATATGCCCACTATGGTCAGCGATGGAACAGAGACGGATGTTTCAGTTCACGAAGACGAAGCGGTTATGTTCATGCTACCAGGTCTGGAAGGTTGTGCTGCGGGAATGGCCCCTCTTTGTAAACGCCTTAAGATTAAGATTTGTGCTCTGCAATTCGGTGCTGAAAGTGGCAATGACACTTTAGATGATTTGGTTAATAGTCTACATCAGAAAATAACATCACGTTTAATCCATGGAAAACCTTACATCCTCCTTGGATACAGTTTTGGAACTCTTCCCTTGTTTAAGTTAGCAAGCATTCTAGAAAGCGAAGGACACTCTGGAACAGTGTTTTGTATAGACGGCAGTCCTGAATTCCTCTCCACCACGATTACGACCATTGCAGAATTCAGAAATGACAAAATATTACAGAACAGTCTGATTTGTTACACCATAGATTTGGTTGCTCCTAATAATGATGTAACAAAGAATTTGATGGAAAAACTTCACGAAATTGAATCGTATGATGAGAGGATTGACTATGGTATAAAAATTTGTCCAGTTCAACACAGCTATTCTCACAACTTTATACGCGCCATGTCGAAAGCTTGTTACAATAGAACTAAAATGATTTTTGAACATAACGAAAATGATGTTAAGAAAATTAATGCGCCTGTCATACTATTACGACCCAAAGAAATTCCCTTACCTCTTGAGGACAATTATGGTTTGGACAAATATACTGAAGGCCCTGTTACTGTACATTATTTGGAAGGAAACCATGTAACTATAATTGAAAATAAGGATTGTGCAAATATCATAAATAAAATTATAGGGGGTCAAGATAAAGTAGAAAAGGATGCGCCAAACGTGGTTACAAATATGATAGAACAACAAAGATCAGTTCAAGTTTAG

Protein sequence:

>DPOGS207082-PA
MAPSPVSEDRLTSGHRLSHPPPGDEVLITGISGYFPDSDSVLHLQENLFNKVDLISGDARRWKLAHPEIPQRTGKINNVNRFDASFFGVHFKQAHTMDPMCRILLEKAYEAVIDAGLNPKELRGTKTGVFIGACFSESEKTWFYEKMQVNGFGVTGCSRAMLANRISYWLGVTGPSYTVDSACSSSLYALEHAFRAIRDGHCDAAIVGGSNLCLHPYVSLQFSRLGVLSADGRCKSFDNSANGYARSEAIVVCLLQKAKDSRRVYAQVLHAKTNCDGYKEQGITYPAGNIQKLLLNEFYEECSIPPNTLEFVEAHGTGTRVGDPEELVAIEDVFCTGRSEPLMIGSIKSNLGHSEPASGLCSIAKLCIAYSTGYIPPNLHFTVPREGVKALSENRLKVITQKHPWNRGMSGINSFGFGGANAHTLLKNVAREKVNNGIPSDDLPRLVCVSGRTESAVAKILDDLESRTVDAELIRLLHAIHDDDITGHVVRGYSLLESTPVKSKSVSRDIQYFSGVKRPVWFVYSGMGSQWAGMATQLMRIPVFAAAINKCHKALEPKGVNLIKTITDPDPSIYDNILNSFIGIAAVQIGLTDVLKAVGIEPDFIIGHSVGELGCAYADGCFTAEQMILSAYSRGLASIETPFIKGSMAAVGLGYAQIKSIIPPEIEVACHNGPDSSTISGPAEIMKSFVAKLVAQGVFAKEVPCSNIAYHSKYIADAGPKLLQYLREVIPQPKSRSERWVSTSVPQMLWKDPRATLSSAEYHTNNLLSPVLFEETARLIPGNAITIEIAPHGLLQAILRRSLKKDVLNIPLTEKKHADNVQYFLTALGKLYEAGLNPHLANIYPHVPFPVSQGTPMLSHLVEWEHSEDWYVTSYKAQDKMKSGERTVRMSIVDEDSEFMAGHVIDGRNLYPATGYLVMVWETLGMMLGELYTEVSVVFENVRFQRATNIPKDGNLEFIVMIQKGSGQFEIVESGASIVTGRIYSKKNVGQDFRVLPASPESSGPNVKHLLTKDFYKELRLRGYQYSGLFRGIIGCNVEGTRGRLAWQNNWVAFMDCMLQLKLIGQDTRGLFVPTRIERLSIDANMHYDAISKMNPDSARQSFEIRVYPEVEVIRAGGVEIRGLHATPISRKNPLGVPVLEKNVFVPNFGKSKMKIEDILRANIQLVLENIQTYKVKAIEVIDEEYKTNNYEPIMEIVGEVLGDLPLIQADMTVFSEEPIEMPSNIVVENKKATAESNTIIYIGANLLRRPEVLKPALSTLRDKGFLISREKDLINPKDFSDKFDIISIQDTGVEYLVLFRKRIGAKPAKFIKIVTTDDTYSWIDKVKEELKVGQKLVLYNEDEPINGLLGLVNCLRREPGGEVVHGLLIADPSAPAFNPDLEFYEEQLDKDMALNVYQDGQWGTYRHLLLGDLDVVNAHHAFVNTTTIGDLSTLKWLEGPIRENTVFKDPYSILIHVYCAALNFRDVMTAMGRVTVDAVARGRLAQECVQGFEVVGRTCNGSRVMAMVRNSGMANMIEGDKALMWNIPDEWTFEEAASVPVAYGTVYYAMVMVARVQRGESVLIHAGSGGVGQAAINVALHYGCEVFTTVGTPEKRAFIKKLFPQLKDSHIGNSRDTSFEDMIRKETKGKGVDIVLNSLADEKLQASVRCLGYRGRFLEIGKFDISNNTPIGMYFFLKETSFHGIMLDFIFDHSYEFRKSLQDLLLSGIESGAVRPLTYCTFEKHEIEAAFRYMAAGKHIGKVIIKIRDEERSHRPVKPVTLRIPATPRYMCMDDYVYVVVGGLGGFGLELTDWLILRGARRVLLTSRRGVSNGYQASRLRTWAQYKADVQVSTHDITTEEGCQKMLEMANSMGKVEAIFNLAVILKDSIFQNQTPETFKISFGPKAKATMNLDKLSRKLCPGLKDFVIFSSVSCGRGNAGQTNYGYSNSVMERICEWRHKLGLPALAVQWGAVGDVGLVADMQDEDVQLEIGGTLQQRISSCLLSLDKFLKQDAVIVSSIVVAEKKAGGSGCGNIVDAVAQIMGIKDLKTVSQQVSLAELGMDSMMAVEIKQTLEREFEIFLTAQDIRTLTFARLVELTAIREAAASTSATPRSNSSDITAGLRTFIRNFGNENLATEPFIYMPTMVSDGTETDVSVHEDEAVMFMLPGLEGCAAGMAPLCKRLKIKICALQFGAESGNDTLDDLVNSLHQKITSRLIHGKPYILLGYSFGTLPLFKLASILESEGHSGTVFCIDGSPEFLSTTITTIAEFRNDKILQNSLICYTIDLVAPNNDVTKNLMEKLHEIESYDERIDYGIKICPVQHSYSHNFIRAMSKACYNRTKMIFEHNENDVKKINAPVILLRPKEIPLPLEDNYGLDKYTEGPVTVHYLEGNHVTIIENKDCANIINKIIGGQDKVEKDAPNVVTNMIEQQRSVQV-