Monarch geneset OGS2.0

DPOGS215834
TranscriptDPOGS215834-TA3417 bp
ProteinDPOGS215834-PA1138 aa
Genomic positionDPSCF300073 + 412057-428614
RNAseq coverage1836x (Rank: top 7%)
Annotation
HeliconiusHMEL0116440.079.54% 
BombyxBGIBMGA013563-TA0.076.92% 
Drosophilaby-PA6e-7247.20% 
EBI UniRef50UniRef50_E0VF330.039.79%Tens, putative n=1 Tax=Pediculus humanus corporis RepID=E0VF33_PEDHC
NCBI RefSeqXP_002424660.10.039.79%tens, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420076880.039.79%tens, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2700067350.039.05%hypothetical protein TcasGA2_TC013103 [Tribolium castaneum]
Group
Gene OntologyGO:00055153e-26protein binding
KEGG pathwaydpe:Dper_GL122924e-57 
 K00665 (FASN)maps-> Insulin signaling pathway
    Fatty acid biosynthesis
InterPro domain[999-1134] IPR0136253e-26Tensin phosphotyrosine-binding domain
[884-981] IPR0009802.9e-22SH2 motif
[995-1136] IPR0060207.2e-15Phosphotyrosine interaction domain
[247-374] IPR0089733.9e-14C2 calcium/lipid-binding domain, CaLB
[302-371] IPR0140207.3e-10Tensin phosphatase, C2 domain
[993-1127] IPR0119931e-09Pleckstrin homology-type
Orthology groupMCL12073 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215834-TA
ATGAGACCACGTGTCAAGAAGGAATCTGAAGCAGCTAGGGCGCTCCATCGACTTGCGTTCGCGTTGTGGCGGGCGGCCTTTTATTTGCAGCGCGGACCCTCACGAGCACATACTTCGAAGCAAGGAACCTTGACCCGAGCCGTCAGCGCCCCAGCAACCCCGGCCACTGCTGCTTTTGAAAGGGAACATGAAGGCCTTCGATCTGTCAGTTATCCTGGTCCAGGTGGATCTGGGACCCGGTTAGACCTGTGTTACGTTGCGGAGCGGATGCTGGCTCTACATCTTCCTGATAGAGACGCACATGCAGAGGCACAAGCCGCACACATGCTTAATAATAAACACGGAGAACATTATATGGTATTCGAAGTAAGTGGAAGTGATACGAGTGGCGATGGTCGAGTGGCTCGAGGAGTGTTCGGTCGCGCTCGGTCCCTGGGCTGGCCGGGGGACCTGGCGCCGCCGCTGGAGAGACTCTGCGCCGCCTGCAAACACATCGAGAGCTGGCTCGCGGCTCATCCGAAGAATGTCGCTATATTACTCGCTTGGGGTAACCGCGAACGTCTAGGAGTGCTGGTGGCGGCTTACATGCATTACTCCGCTATCTGTGGGGCGCCGGAGCACGCACTAGATCGATATGCGATGAGGAGATATTTGGACGACAGAGTACCGATATTCCACCTGCCTTCTAACAAACGATACATAGACACGTTCGCGGGTTTGCTCGCCGGGCAGATTCGCGTCAACGCTGCACCTCTACACCTGACGCACGTGAGCGTCGCGGGTTCCCTGGCAGCACCAACCACTCCGACCATCGCCTTCCTTAAGATCTATGAGAATTACAACTGCGTCTATACATCTGGTCTGTACATGGTGAATGGCGGTGGATGGACGGTGGGCGTCGGCCGGCTGGCTCTGCGCGGCGACGTGCTCGTCCGAGCGTACCGCCGGCCGGCTCACACCAACACACACACACAGCGCCACCTGGTGTTCGCCTGTCAGTTCCACACGTGCGCTGTCGCTGACCACACACTATCCTTCACTAAGCAACAACTTGACCACGCTGTGCACGATCCGTCATTTCCGTCGGACGGTGCTGTGGAGCTAATCTTTGCTCGGGGGGAAATGACACAGATGTTAGGAAGAGCGCCTCGTCCGCACATCCTCTTCCTGGGCCAGTATTTCGTAACAGTAGCCTTCTCCGGCGACTGTGATAACAACGGCAGCGCCGAGGGTTCGGGCTCCGGCGGGGAGACGGGTGAGGGGGGTGAGGCAGCCCATACGTTCGGACCTCTCGACGGGTCCATCTACGCGACGGTAGTGCGAGCGAGCGGAGGCCCATCCTCTCCGCTCAGCGCCTCCATGGACTCGGGCATCTCATCAGCGGGCCGCCGCGCTGCCCCCAGTCCACCCGATGAACTGGACGCGTTACTTGGAGATATGTTGAGAACTGTGAGCGCGCTGCCCGATCCGCCGCCAGCGCCGCAGTCGCACGCCGATCGTGCTCCTGATATACCGTATCACGCGCGCGCTGACTCTGCCCCATTCACGTACGGAGCTCCTGGTCTGAGACCCGGCATGCTGCGAGCCTCCAACCGGCTCGCTAGTCCTGAGTTGGTGCGGAGGGCTCTGGGCACCGACCGGAACTATAAGGGGATAGAAGATGATGATGACGACCGCATCCTCACCCCGGAACCTTCACCTCGCAACCCACTGTCACCACGCACAGTACGGAAATTTAGTCACGAAAATGTCACTACTACTACTGCTACTACTACTACAAGCATTCCTCCACCGACTTCGCCTCTACGTAACGGAAGATGGGCCAATGGTGATACGGAATGGATAGAGCGTCCGCCGAGTAGCGCAAGGAAGACTCCAATTGAGAACGGTACTCTGAGATCAAGCAGTACCTTAGGTTGGTACGAAAGCAATCGTTTTAGAGAACCGAGGAAATCTGAGGGTAACAATAATGTTGAAGCAACTTCCGGGCTCACGTGGTTACAAAGGCAACAGCAAAAGTTAAGAGAGAAAAAAGAAGCCAGGGAAAGAGTCGCTCGACTGCCGCTCGAGTGGGACGCACCTTCCCGACACGTGAGGCGATCAGCTAGCCATCGTGTGGACGGTTACACTAGCGATACTACAGCGTTTGCCGATGATGACGACGACTTTAGTGTACCCCTGCACGTCAACACACGTACTCTTAACAGGACTGATAGCATCAGCCCTCAGGCTCCAGACAGGACCTCATCTAGAAAATTCATGTATGAAAAGATGACGATGCGCGAGTGGAGTACTAACACAACACCGACGAGTACACAAGCGCCTGGTTTACTGTCGCTGGCGGAGCCGAACAACAACGATACCATTAGTCGAGAAAGGACAGAGAGCTGGTCTAGAAGCGAGTCCCGCGCGGGTACGCCGGCTTTTCCAACCCACCCCCGAACACCGTACCCTCCCACACCAACACCCTCGCTATCCGCTAGACCACCAAGATCACCCACCGTCTCCAGAAAAGAACGTGAAGGCAGCCCGGAGTCGGAATATCGTACATACAATGGCAGCGTCGGATCTCGGCGGTCGAGTGTGAGCGGCGGTGCGGAGCCCCAGCATGTGGCGCCTGACCGCGTGCGATTCGCGAGAGATACCAGTCACTACTGGTACAAGCCCAATATATCTAGAGATGACGCGGTGACAGCACTCCAACAGCTGGAAGAAGGTGCGTTTATAGTGCGTGATTCCAACTCCTTCCCCGGCGCCTTCGGTCTAGCTGTCCGTGCGGGAACGGGAGTCCGTCACTTCCTCATCGAGCCTACAGCCCGAGGAGTCCGACTGCGAGGGTGTCCCGATGAGCCTGTATTCGGGTCCCTATCAGCGCTCGTGTATCAACACACTGTTACACCCCTCGCCCTGCCTGTGCCACTCAAACTGCCTGACAGAGACCCGTGGACGGGCGGCGCGAGTGCAGCGGCGCGTGCGTTATTAGCTACAGGCGCGGCCTGCAACGTATTGCTACTGGGCTCTGAGAACACCGAGGCTCTCACCGGACCGGCCGCTGTCAAGCGAGCTGTACAGAACATATTGCAGAAAAAATCGCCAGCCCACGTGGTTCACTTCAAAGTATTCGGTGGCGGCATCACACTGACGGACGCGGCTAGGAAATTGTTCTTCAGGCGCCACTATCCGGCTACTGGTGTGTCATACGCCGGCATAGACCCTGACGAGCGTCGGTACAAATACGTGGACAATGGAACCCAGACCGAGAAACGTATCTTCGCCTTCGTCGCGCGCGCCTCTTCCGGCGCCGACAACCAGTGTCACGTATTCGCTGAGTTGGAACCAGAACAGCCAGCTACTGCTATTGTCAACTTCGTAAACAAGGCACTTCTCGGTAACACTCAGAAACAGGACATAATATAA

Protein sequence:

>DPOGS215834-PA
MRPRVKKESEAARALHRLAFALWRAAFYLQRGPSRAHTSKQGTLTRAVSAPATPATAAFEREHEGLRSVSYPGPGGSGTRLDLCYVAERMLALHLPDRDAHAEAQAAHMLNNKHGEHYMVFEVSGSDTSGDGRVARGVFGRARSLGWPGDLAPPLERLCAACKHIESWLAAHPKNVAILLAWGNRERLGVLVAAYMHYSAICGAPEHALDRYAMRRYLDDRVPIFHLPSNKRYIDTFAGLLAGQIRVNAAPLHLTHVSVAGSLAAPTTPTIAFLKIYENYNCVYTSGLYMVNGGGWTVGVGRLALRGDVLVRAYRRPAHTNTHTQRHLVFACQFHTCAVADHTLSFTKQQLDHAVHDPSFPSDGAVELIFARGEMTQMLGRAPRPHILFLGQYFVTVAFSGDCDNNGSAEGSGSGGETGEGGEAAHTFGPLDGSIYATVVRASGGPSSPLSASMDSGISSAGRRAAPSPPDELDALLGDMLRTVSALPDPPPAPQSHADRAPDIPYHARADSAPFTYGAPGLRPGMLRASNRLASPELVRRALGTDRNYKGIEDDDDDRILTPEPSPRNPLSPRTVRKFSHENVTTTTATTTTSIPPPTSPLRNGRWANGDTEWIERPPSSARKTPIENGTLRSSSTLGWYESNRFREPRKSEGNNNVEATSGLTWLQRQQQKLREKKEARERVARLPLEWDAPSRHVRRSASHRVDGYTSDTTAFADDDDDFSVPLHVNTRTLNRTDSISPQAPDRTSSRKFMYEKMTMREWSTNTTPTSTQAPGLLSLAEPNNNDTISRERTESWSRSESRAGTPAFPTHPRTPYPPTPTPSLSARPPRSPTVSRKEREGSPESEYRTYNGSVGSRRSSVSGGAEPQHVAPDRVRFARDTSHYWYKPNISRDDAVTALQQLEEGAFIVRDSNSFPGAFGLAVRAGTGVRHFLIEPTARGVRLRGCPDEPVFGSLSALVYQHTVTPLALPVPLKLPDRDPWTGGASAAARALLATGAACNVLLLGSENTEALTGPAAVKRAVQNILQKKSPAHVVHFKVFGGGITLTDAARKLFFRRHYPATGVSYAGIDPDERRYKYVDNGTQTEKRIFAFVARASSGADNQCHVFAELEPEQPATAIVNFVNKALLGNTQKQDII-