Monarch geneset OGS2.0

DPOGS214976
TranscriptDPOGS214976-TA4743 bp
ProteinDPOGS214976-PA1580 aa
Genomic positionDPSCF300256 - 346311-359873
RNAseq coverage78x (Rank: top 65%)
Annotation
HeliconiusHMEL0148121e-7657.27% 
BombyxBGIBMGA012149-TA2e-7053.92% 
DrosophilaCG6083-PA2e-5337.33% 
EBI UniRef50UniRef50_G9F9G35e-12249.47%Seminal fluid protein CSSFP011 isoform 2 (Fragment) n=2 Tax=Chilo suppressalis RepID=G9F9G3_9NEOP
NCBI RefSeqXP_308086.34e-6242.37%AGAP011050-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3640235732e-12149.47%seminal fluid protein CSSFP011 isoform 2 [Chilo suppressalis]
NCBI nr blastxgi|3640235732e-11541.75%seminal fluid protein CSSFP011 isoform 2 [Chilo suppressalis]
Group
Gene OntologyGO:00551141.7e-37oxidation-reduction process
GO:00164911.7e-37oxidoreductase activity
KEGG pathwayaga:AgaP_AGAP0110501e-61 
 K00011 (E1.1.1.21, AKR1)maps-> Galactose metabolism
    Glycerolipid metabolism
    Pentose and glucuronate interconversions
    Fructose and mannose metabolism
    Pyruvate metabolism
InterPro domain[1076-1378] IPR0013954.5e-97Aldo/keto reductase
[1078-1375] IPR0232103.3e-82NADP-dependent oxidoreductase domain
[1110-1134] IPR0204711.7e-37Aldo/keto reductase subgroup
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214976-TA
ATGTCGCCAGTCCACCACAGTGTGCTGCTCAAGTTCTTCAAAGATACAACAAGCATGTCGGAGAAAAAAGAAGGAATAGAGTCGAGGGTGCCGAAAGAACAAGAGATGCGAGGGGATAAGGAGATCCGGGAGGAGATCGCTAGAGCTGGTGAGGTGGAGATGAAACACTTCAGTCAGGAGCCGGTGATCCAGGTGACGAGAGAGGAATCCTCCTTCGTGAAGGGAGATCTGAAGGTGGAGAAGCTGGTGGTCACAGAGGACATCGCGAATGGAACAACAACCAAACACATTGTGACTGTGACGGAATCCGGTGAAGCGAATGAGGAGCAAACTGAAATATACAGCGAAATGACTGCAAGGAAAACCGAGCGTGATGACCAGAGGCGAGAAGGTTTCCAGAGAACCGTGCACAGAGTGACGAGGATACCACAACTTAAATTTAAACACACGGTCGATAAGATAAAGAAGCAAGAGGAAATGATAAAGCAAGGGAGAGATGAGGTGAGCCTTGCACTTCGAGTGGCAGAGGTTGTACATGAAACAAAAAATGACAGATCCCATTTACAACAAATTGATAAGGTGCAGTCTCCAGATATGGAAGATGAATTTGATAAAATATATGACGAAATATGCGACGGTCCACCATCAGGTAACTTCCCGACACCTGGAAAAATTGACGACTCGGACGGATTAGACAACAAATTTGAAGAAATTATGAACAATTACGATAAGGTCGAAGAACAGAAAGAGCAGGCGGGTTCAGGAACAGTGAAAAGCAAAATACCTTTGTTGAAACGTAAAAGTGAACAAGACATAGACGTCACTAAATCTACTTTAAGAAGAAATTCCCTTAAGACATCAAGTTCTGTTGATGACACGAAGCATTTTATACGTAAGTTACCAGAACCAGTCACCACAGAAAGAAGAAAACAAGAAGAATCCATCGAGGGTAATACTGTGACGTCTTTAAATAGCGATACCTGGGTTCAGTCTGAGACCGTGTCGGTCAAGAAATTTGCTTCAACAAAAAAAGTCACAAGCGCCTCAGAAACAATATCCACACAACAGAGTCATGAACAGGCCTGTGACACAGAAAATAATTTGATAACACATCGATATATTACAGGTAGCAGAATTCCAAAGCATTTTAAAAATATAAACGGGAGAGAGAAAAAAAATGAGGACAACAAAACAAATAGAAGTGAAACGAAAGAGATCAAAACAACGACCATTAAAGATTACACGATAACGTTGAACATAAAAGACAGAATATCTAAGGTTGAAACAAAACCGAATATTATAGAAAATATTAAAATACTTGAGAATAACAACGAGATCACACAAAAAGACTTTCAACAAACAGAAGCTGAAGAACAATCACACACTGACAATACACAACCAGAAACAAGCAAAGTAGAAGCGACCTCACCGACTATAATGACTACTAATAAAGAAGATAGTAATGAGGTCGTTCCAGACAAATATTTCAGAGATTTAGATACAAAAAGATCTCAAGAGAAAGATTCGAATAATACGATAGTAGAGGAACCATCCAACAAGAACAGAAACAACAACGAAACGACATCCGTCTCTCCTCGTGAGTGTCACAGAGACAACACGTTAGAAATAAGACATGACACCACTGACATACAAGATGATGACCAACTCAAAGAGGCGGTTAATGTCGTCACGTTAGAATTTCCAAATCTTTTGATTTCTGCGTCTAAAAGTTTTTCAAACGATTGTCATAGAAATGTTTCGAGAACCCTCTCTAAAGAAACAACAGAAATAAAATACGAAGGTGAAGAAAAGGTCATGATATGTGATAAAAATGTTGTAAAAGTAATTAATCAAAGAACAGAACAAAACCACAAGGAGACAGAAGACAGAGAAGATAAGAGAGTTGGAAACAACATTAATCAAGAAAACTACTCCGCTGGTAAAGAATTTACCATCGAGACGAAAAAAGAATTCCAAGAGAGAGATATCACAGGCAATAAAGAGTATATAGTATCAAAATTTAAAAATAATGAAGAGAAAATAGAGGAAGAGTCACAGAGAAACATCACGACCCATGAACAAACAGAAAAAGAAATAAACATGGAAGAAAAATATAGAGAATTTTATATCAAAAAGTTACAAAACATGATTCCAGAGAAGTTAGAGGAGCTAGAGGAAGAAGACATCATTTTATTAAAGGGGAAAGTTAAGAGAGTCATGAAAAGACTGAATTCCAAAGAGTTCAGGAAGAGACAGACAGAGACTGGAGACCTGCCGACAGACGTGTCTGTGATTAATAAGATAGCGTTGTTTGAACAATCCAAAGATTGTTTGCGAAGAGACGGTTTAGAAGAAGAAGATAAAAAAGACAAAATACATTCAAACAAACCAAAGATAGTCGACGAAGTACTGAAGGCAGAGGACCCCGAGGCTAACGAAGACGAGAGGAGTAAGAGAAATGATTCAGTGAGAGAAGAGTCCACCTTCAGCTACAACGCGTTCCTCAACTACAAGAATACTATTAAGAGAGAAATTAGTGTTCAGGAGAATAAAGCGACAGGACGCGGGGCTAAGACAGAGTGCCATGAAAAATACGAGTATAACATAAACCTGAACAACTTCACGGACGATAAGAAAAATATACTGGAGAAGATAGAAGCGGCCGCGCTGAAAGATAACAAGAGCAAAGATATCTTAAAGGAATACAAGATAAACAAAGAAAAAGATTCAGAGACGAAGCGAGCGAAGGCTGAGGCCGACGTCAACTACCAGGGCGTCCGAGGGAGAGTCAGAGAGATGGTGATGAGGATGAACTCCACAGACAGGACGGCGCTGGAGAGAAGAGAAGGAGACAGGAGGAGACGGAGGCAGGGCGCGGTGGCGGAGGCTATAGCGCTGTTTGAGGTCAGCGAAGTCGCGGGAAACGGCTGTCAAGAAAAAAACCTTTGCATATTAATGGTATCAAAATACCTGGTCGCATTTTATAATGAGATGAGGGTAGGAGACGAGACCACTCGCCCCGGGACAGTCGCTGGAACACGTCGAGAACTCAGAGGTCTAACCATGTTGGTGTTTAAAAAGGTCCCAGCTGCGCGTCAGGAGTCTGTCGCTAACGCGAGCTCTCTAGCGAAGAAAGACGAGGCGCCGGTCCAGACAACGGAAGAGGAACTGCTCCAGAAGATCGCGGAGCTGGAGCAAGCTGAGGAGAAGTATGGCTCGTACGAGAACATGACGTACGTCGAGCTCAGTGACGGCGCCAGGATGCCGACCCTGGCTGTAGGAACAGCTCTGCTGGAGAATGGTCTCATCAAACCCATCATAAAAGCAGCAATAGCTCTCGGCTACCGAGCTATAGACAGCGCCTTCGTGTACGGTAACGAGCGCGCGGTGGGCGAAGCCATCAGAGAGAAGATACAAGACGGAACCGTGAGACGCGAGGATCTCTTCATCATTTCCAAGTTGTGGAGCACGAGCCACCGCCGCGACCTGGTGCCCAGCGCCTGCCGACAGTCTCTGGACGCGATGGGGCTGGACTACTTCGACCTCTACCTCATACACAACCCCATGTCCTTCAAGGAGGGCGCCAATCTTCTACCGAAAATCGCGAACGTCCTGCAATACTCCGACCACGACTACCTCGATGCCTGGTTCGGTGTGGAGAGCTGCATCAAGCAGGGCCTCGTCAAGAGGGGAGGGGTCAGCAACTTCAACTCTGTCCAGGTCCAGAGAGTTCTGGATAAAGGGAGGATCAGGCCGGTCATTAACCAGGTGGAGTGTCATCCGTATCTGAGTCAAACTCGTCTCCACGACTACTTGGTGCCCCGCGGCGTATCTCTCAGTTGTTATGGGGTATTGGGGTCGTCGGGGACCCCGCGTCACCTGCGCTCCCCGCTTCCCCCCGTCATCAACGACCCCCTCGTCAGGACCATGGCCGCCGGGCTTGGGGTGACTCCGGCGCAGTTGCTTATTAGCTACCAGGTCCACATGTGTCACCACGTGGTGGTGAAGGCGTCCAGCGCTGGTCACCTCCGCGACAACCTGCTGTCACTGGAGCTGCAGCTGGAGCCGGCCCACGTGGCCGCACTCTCCGCCCTGAACAGGAACAAAAGGACCTTCACCTTCCAGGGAACCTCGATGTCATTTCAGGAGGGCGCCAATCTTTTACCGAAAATCGCGAACGTCCTGCAATACTCCGACCACGACTACCTCGATGCCTGGTTCGGTGTGGAGAGCTGCATCAAGCAGGGCCTCGTCAAGAGGGGAGGGGTCAGCAACTTCAACTCTGTCCAGGTCCAGAGAGTTCTGGATAAAGGGAGGATCAGGCCGGTCATTAACCAGGTGGAGTGTCATCCGTATCTGAGTCAAACTCGTCTCCACGACTACTTGGTGCCCCGCGGCGTATCTCTCAGTTGTTATGGGGTATTGGGGTCGTCGGGGACCCCGCGTCACCTGCGCTCCCCGCTTCCCCCCGTCATCAACGACCCCCTCGTCAGGACCATGGCCGCCGGGCTTGGGGTGACTCCGGCGCAGTTGCTTATTAGCTACCAGGTCCACATGTGTCACCACGTGGTGGTGAAGGCGTCCAGCGCTGGTCACCTCCGCGACAACCTGCTGTCACTGGAGCTGCAGCTGGAGCCGGCCCACGTGGCCGCGCTCTCCGCCCTCAACAGGAACAAAAGGACCTTCACCTTCCAGGGGATGGGAGACACGCACAAGAACTATCCCTTCAAGATACCTTTCTGA

Protein sequence:

>DPOGS214976-PA
MSPVHHSVLLKFFKDTTSMSEKKEGIESRVPKEQEMRGDKEIREEIARAGEVEMKHFSQEPVIQVTREESSFVKGDLKVEKLVVTEDIANGTTTKHIVTVTESGEANEEQTEIYSEMTARKTERDDQRREGFQRTVHRVTRIPQLKFKHTVDKIKKQEEMIKQGRDEVSLALRVAEVVHETKNDRSHLQQIDKVQSPDMEDEFDKIYDEICDGPPSGNFPTPGKIDDSDGLDNKFEEIMNNYDKVEEQKEQAGSGTVKSKIPLLKRKSEQDIDVTKSTLRRNSLKTSSSVDDTKHFIRKLPEPVTTERRKQEESIEGNTVTSLNSDTWVQSETVSVKKFASTKKVTSASETISTQQSHEQACDTENNLITHRYITGSRIPKHFKNINGREKKNEDNKTNRSETKEIKTTTIKDYTITLNIKDRISKVETKPNIIENIKILENNNEITQKDFQQTEAEEQSHTDNTQPETSKVEATSPTIMTTNKEDSNEVVPDKYFRDLDTKRSQEKDSNNTIVEEPSNKNRNNNETTSVSPRECHRDNTLEIRHDTTDIQDDDQLKEAVNVVTLEFPNLLISASKSFSNDCHRNVSRTLSKETTEIKYEGEEKVMICDKNVVKVINQRTEQNHKETEDREDKRVGNNINQENYSAGKEFTIETKKEFQERDITGNKEYIVSKFKNNEEKIEEESQRNITTHEQTEKEINMEEKYREFYIKKLQNMIPEKLEELEEEDIILLKGKVKRVMKRLNSKEFRKRQTETGDLPTDVSVINKIALFEQSKDCLRRDGLEEEDKKDKIHSNKPKIVDEVLKAEDPEANEDERSKRNDSVREESTFSYNAFLNYKNTIKREISVQENKATGRGAKTECHEKYEYNINLNNFTDDKKNILEKIEAAALKDNKSKDILKEYKINKEKDSETKRAKAEADVNYQGVRGRVREMVMRMNSTDRTALERREGDRRRRRQGAVAEAIALFEVSEVAGNGCQEKNLCILMVSKYLVAFYNEMRVGDETTRPGTVAGTRRELRGLTMLVFKKVPAARQESVANASSLAKKDEAPVQTTEEELLQKIAELEQAEEKYGSYENMTYVELSDGARMPTLAVGTALLENGLIKPIIKAAIALGYRAIDSAFVYGNERAVGEAIREKIQDGTVRREDLFIISKLWSTSHRRDLVPSACRQSLDAMGLDYFDLYLIHNPMSFKEGANLLPKIANVLQYSDHDYLDAWFGVESCIKQGLVKRGGVSNFNSVQVQRVLDKGRIRPVINQVECHPYLSQTRLHDYLVPRGVSLSCYGVLGSSGTPRHLRSPLPPVINDPLVRTMAAGLGVTPAQLLISYQVHMCHHVVVKASSAGHLRDNLLSLELQLEPAHVAALSALNRNKRTFTFQGTSMSFQEGANLLPKIANVLQYSDHDYLDAWFGVESCIKQGLVKRGGVSNFNSVQVQRVLDKGRIRPVINQVECHPYLSQTRLHDYLVPRGVSLSCYGVLGSSGTPRHLRSPLPPVINDPLVRTMAAGLGVTPAQLLISYQVHMCHHVVVKASSAGHLRDNLLSLELQLEPAHVAALSALNRNKRTFTFQGMGDTHKNYPFKIPF-