Monarch geneset OGS2.0

DPOGS202986
TranscriptDPOGS202986-TA4863 bp
ProteinDPOGS202986-PA1620 aa
Genomic positionDPSCF300068 - 398197-434487
RNAseq coverage968x (Rank: top 13%)
Annotation
HeliconiusHMEL0110377e-17857.54% 
BombyxBGIBMGA012331-TA4e-12353.04% 
DrosophilaCG11148-PG1e-3258.96% 
EBI UniRef50UniRef50_B4L7A14e-3455.97%GI14083 n=3 Tax=Drosophila RepID=B4L7A1_DROMO
NCBI RefSeqXP_002011405.17e-3555.97%GI14083 [Drosophila mojavensis]
NCBI nr blastpgi|2420206564e-3233.17%hypothetical protein Phum_PHUM498170 [Pediculus humanus corporis]
NCBI nr blastxgi|3838512603e-8031.00%PREDICTED: PERQ amino acid-rich with GYF domain-containing protein 2-like [Megachile rotundata]
Group
Gene OntologyGO:00055155.6e-17protein binding
KEGG pathway 
InterPro domain[915-962] IPR0031695.6e-17GYF
Orthology groupMCL20553 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202986-TA
ATGGGCGATCGTAACAATCCTATTAAGTTCGGCCCCGAGTGGCTGCGTAATTTGGCACGCGAGCGTACTGCTGGGAGGGCAACTACAGCACAGACCACAAGGCCCGGGGCCGCAGGCGGCAGCAGGCCTGCCGGCTCCCAGGGGGCCGGCTCGACGTGCACAGGCTCCCCGGGGGCCGGCCCCTCTACCGCCACGGGAGTATCGAGTTCGGGCCCCGTCGGGGCCTCGCTGACTTCAGGACCCGCGTCGTCCGCCGCAGCTTCCAGCACGACCGCAGTCGCCGGAGCCTCAGGCACAAACTCAAGAAATACAAACACTAACCATCAGAAGATTCAGCTCGCTAAGCTAAGATACGGCAGGGAGGAGATGCTGGCATTGTACGACAGGAACGCGGAGGCTCCAGAGGAATTGAAATACTTTGACTTGTTGTACCAGCCGCGGGGGAAACCGCCTGTTGCTCTCAACACATACGACGATGATACGCGGGAGGCGCGCGGCCCGGGGGCGGGGGCGGAGCGCTACGGCGCGGGCCGAGGACGAGGAGCTCCGGTAGAAGCGCGCGGTCGCTCGCGGATGCCGTTCGTGAGGCACGCTTCCACGGTCTCCACTGGCAGGATTGATGGTACAGTATGTCTCGCCATACCAAAGTCACGTTTCATAGCCTCAGTTCAAAGGCTGTTGCACTATCAGTCGGCCAGGTTCGCTCTCGTTCCGCTTATCGAAGAGCTCTACTCTGGTCTCAAGGGAATTGCCTCGGGGGCAACATCATTGCATTCCGGTATTCCCCGAATGCCAGGGTATAGCGGTGGGCTCGATGAGGAGGGCCCCAGTAGACCTTGGAGTAGCAGCAATAACAGCGGTTCACCCAGAACTGATCAAGGCGATTGGACGACTAATAAAATGTTTCGCAGGCGACAAGCAAATAATACTAATTGGAGGCAAACGTCTCGCGACGAAGGAGACGAGTGGCGTCAGGAGAATTCCAGACCTCCCAATCGTTCTAGTGTCGATAAATGGGACCGTGATTGGAGCGACCGGCCGTCCGGGGAGAGGCCGCAGTCCTGGGCTCCCAGCCGGCGACAGTGGCCCGGTGACTCCAACAACGACGACAACTTGCCAGAGTGGGCGGTGGACAGTGCGGAGGCTGGTGCTGGTACCTTCGACTCCTCGGGAGCCTTCCACGGGTATAGCAATGACGATACAAACATACCTAAGTCACAGGAGTCCACTTACCCGTTGACTCGGTCGCATACACACGGTGGTAGTATTGCGCGCTCTAAAACTGTGGAAGAAGGCTCTGAAGAGTGGTGGGCTTCAGAGAAAGCCAAGAAGCTATCACCGAAAAGGTTTGAGGCTGGCGATAGTAGATATAAAAAGTCTTTGAGTACTGGAACAGACGAAGTCAGCGGTGGAGCGGTAAGTGTCAAACGGACCGATAACACGGAGAAGACTAACGATCTAGAGTCGTCGGAGAGTGTAGACACGCCGGAACCGGAAGCGGACGCCAGTAACGCACAGGCAACTACTGACGAGCAGAAACAAAATGATTTGAGACAAAAGCTCTCGGATAGTAAGACATTTGACGCGTTTATGAGATCAGATATAGAATACCCTGAACCCAACGAAGACAAAGGAAACTTCCAGTCCGTCATGATCAATTCCAACAACGGCTTGCGGCAGAAACATCAGAATATAGTGACGGTGAGCAACGAGACCGCCATGAGCCGGCAACAGATGAACGCCACCGGCCTGTTGCAGATGCTGCACGGCAGGCAAATGGGCGATCAGAACCCTGAAGAGGAAACTTCTAAAACCAACGAGGAAAAGATCGTTGAAGATCTTATGGACATGACTTTGGAGGACGGTCGCATGCGTCCTAATCCAGCGCACCAACCCGGTGTCATCGCATCAGGAATGATTAATCAAAGCCAGCTGCTACGTATTGCCAGCCCCGCGGTGCCGCAGCAAGGGATGGTTCTTAATGCGGGACAAGGGATACAGAATGTTGGCATCCCCAACCAGGCGCTCAACTCTTCGCTAGGACTGAATATGGGCCCAGGAAATGCTCACAGCTTGCCCATGCAAGGATTGCTACCACCAGTGATGAATACTCTGAATCCCGCTATGGGCACGGCCATGCAAGCCCGCGTTATTGGAGCGTTCCAACAGAACGCCGGACTGCCGGTAATGCCAAGTCCTAACGTCGCTAATAATTCTCTATTCATGGGACAAAATAACTCTCAGCAACTACCAAGCGGTGATATGCAGATATCGACACACACGGCTCAGAGCAACCTGTTCCCGATGCACGGGATGCAACATGGCAACCCCGGATTCAGCTCTATCTACGGCAACATTATGCCGCCGACAAATATGGGGGGCAACATGTCGACTAACATCGGTGCTAATATGCCAAACAGTCTCGGCCCCAATATGAATACCAATATGGCTGGGAATATTGGGAGCAACATCGCCGCCAATATGGGTAACAACATTGGCGGGAACATTGGCGCAAACATTAGCGGGAACATAGGCGGAAACATCGGCGGAAATATTGCCGGTAATATAGGTGGAAATATTGGTGCCAACATTGCTGGAAACATTGGAAGTAATATAATTGGCAACATTGGCGGCGCCATTGGTGGGACGATTGGCGGTAACCTCGGTGGTAACATTGGAACCAACCTCAACGCTAGCATTGGTGGTAACATCGGTGGTAACATAGGCACCAACATGGCCGATCAGTGGTATTATGAAGACCCCAAAAAAGTAGTCCAGGGTCCATTCTCGTCTAAGGAGATGTACAGCTGGTATAGGGCGGGTTTCTTTAGCCCCAGCCTGATGGTGCGTAGGGCCTGCGAAACTCATATGCGTCCGTTAGGCTCGTACGGGCCCGTGGTACCGTTCGCGCAAGTGGAGGTACTTCCGCCATATCCGATTACTGGATTCGAACCCCGACCCCAAAATCATGAAATGCTAAATCAGCAGCCGGCTCTCACTATGGAAGAGTCGCTGTGGGGTCAGCCGGCTACCAATCAAGATTTGTTGTGGATGCAGCAGATGCCTCGCGATCGCGGCAACAATCTGCCGATGTTCTTCTGGGATCAGCCATCCTCCGCTATATCTTCCAATGCCTTATTGCCCGAGGAGATAGCTAAGGAGATGAAAACAGAGGATCAGATCCTCGCACAGCTCCGGGCCTCCCAGAACCTCCCCAACCCGGCACCCTTTCTGAACGATACCCCCAGCTCAAGCTCCACAGCTTTGAGTGAAGAGTCTTATACCACGAACGTCAGCTCGACACCGGATCTCAAACAGCTGCAAAAGTTGATGATAAGCGAAAAACTCGCTCCTCAACCAAGGGATATCAAAGCTTCTAGCGTAGAGCGAGAGGCTAAACCTGAGAAACCAAATAAGAAGGATCAGAACACGACTGAGACCATTGCTGCTAAGACCCAGCCTACAAAGGCCGAATCAAAGGCTGCCAAGCAATCCAAGACCGAGAATGAAAAGGCCAAAAACAAAGAGACTACCACAAAGAGTAAGAAACAAAAGGCCAAAGAAGAAAAGAAAGAGGAAGAGAACAAAGTCAAAGAGGATGACAAAGAAAAAACGACACATGAAATTTCACCGACTAAAGGCAAGAAGGAAGACAAAATGAATAGGAAGGAATTAGAAAAAGAGAAGAAGGAATGGATCAAGGAAGGATTCACTATTGTGAAGGGCCCTGAGAAGGAAAGCAAAAAGGAAAATAAGAAAAAACTAGAAGAAGCCAAGGCCGCTGAAGAAGCTGAACGCAAAAAGAAAGACGAGGAGAAGTCAGTGACCGAAGAAGATAAAAAGAAGAAAACAGTAGAATCAAAAAAGCAGCAGGAGCATCCACAACGGAACATAGAGACAAAGAAGGCGCCCTGGTCGGCACCACAGATAGGACAGTTGCGTGACGGACTACCGCTGGGAGAGATTCAGCGTTTGGAGAGAGAAAAGAAATTAGAGCAGATCAGAGAACAGCAGCACATGGTACAACTGCTCGCGCAGGAGCAGGCTGCCGTCGCCGCCAGGGAACAGGTTATCAATGAGATGCAGGCGAATAATCCGCCGTGGACCAAGAAGAAAATTGACCGCCCCAACAACGGAACCAGCCAGAGCTTTGCTGATATTCAGGCGGAGACACGTCGCCAAGGAACGGCTTCCGCTCATCCTCCACCGATGCCAGTGGAGGATACTCTGACGACCAGCAGTCAGGCGCCATGGGCCAATACCCAGAACGGAGCACGATTAACCAAAAATCTGTTCTTCACTGACGATACAAACAATCCAGCAGATGTGTTGAACACAGGAGGATTCTGGGATACACAGCCGAATACGTCGAAAGCTGCTGAGAAAGCGAGGGACAATAGACCCGAGACCAGCAAGAAGAAGAAACCAGCGGTCGCCGCGTCGCCAAAGAAGGAGAGCTCTCCGTGTGCTGAATTTGACACTTGGTCCCAATCAGCGCTCGCTTCCTGGAGCTCCAAGATTGATGTGCCAACATTCGTCGGCTTTCTGAAGGACATCGAATCGCCCTACGAGGTGAAGGACTACGTTAAATGCTACTTGGGCGAGTCCAAGGACTCCAGCGACTTTGCGAGGCAGTTCCTCGAGAAACGATCTAAACTACTCCGTGTTGGGATGGTGACCCCCTCCGATGATCTCTGCTCACCAGCTATGGCTGTCAATCCGCGAGCCGCACTCGACTACCAGGAGGGGAAAGGCAAAAAATCAAAGAAGAACAAGATGTTAAAGGTGGACGCGCGTATACTGGGCTTCTCCGTGACAGCCTCCGAGGATAGGATCAACGTGGGGGATATCGACACCGTTTGA

Protein sequence:

>DPOGS202986-PA
MGDRNNPIKFGPEWLRNLARERTAGRATTAQTTRPGAAGGSRPAGSQGAGSTCTGSPGAGPSTATGVSSSGPVGASLTSGPASSAAASSTTAVAGASGTNSRNTNTNHQKIQLAKLRYGREEMLALYDRNAEAPEELKYFDLLYQPRGKPPVALNTYDDDTREARGPGAGAERYGAGRGRGAPVEARGRSRMPFVRHASTVSTGRIDGTVCLAIPKSRFIASVQRLLHYQSARFALVPLIEELYSGLKGIASGATSLHSGIPRMPGYSGGLDEEGPSRPWSSSNNSGSPRTDQGDWTTNKMFRRRQANNTNWRQTSRDEGDEWRQENSRPPNRSSVDKWDRDWSDRPSGERPQSWAPSRRQWPGDSNNDDNLPEWAVDSAEAGAGTFDSSGAFHGYSNDDTNIPKSQESTYPLTRSHTHGGSIARSKTVEEGSEEWWASEKAKKLSPKRFEAGDSRYKKSLSTGTDEVSGGAVSVKRTDNTEKTNDLESSESVDTPEPEADASNAQATTDEQKQNDLRQKLSDSKTFDAFMRSDIEYPEPNEDKGNFQSVMINSNNGLRQKHQNIVTVSNETAMSRQQMNATGLLQMLHGRQMGDQNPEEETSKTNEEKIVEDLMDMTLEDGRMRPNPAHQPGVIASGMINQSQLLRIASPAVPQQGMVLNAGQGIQNVGIPNQALNSSLGLNMGPGNAHSLPMQGLLPPVMNTLNPAMGTAMQARVIGAFQQNAGLPVMPSPNVANNSLFMGQNNSQQLPSGDMQISTHTAQSNLFPMHGMQHGNPGFSSIYGNIMPPTNMGGNMSTNIGANMPNSLGPNMNTNMAGNIGSNIAANMGNNIGGNIGANISGNIGGNIGGNIAGNIGGNIGANIAGNIGSNIIGNIGGAIGGTIGGNLGGNIGTNLNASIGGNIGGNIGTNMADQWYYEDPKKVVQGPFSSKEMYSWYRAGFFSPSLMVRRACETHMRPLGSYGPVVPFAQVEVLPPYPITGFEPRPQNHEMLNQQPALTMEESLWGQPATNQDLLWMQQMPRDRGNNLPMFFWDQPSSAISSNALLPEEIAKEMKTEDQILAQLRASQNLPNPAPFLNDTPSSSSTALSEESYTTNVSSTPDLKQLQKLMISEKLAPQPRDIKASSVEREAKPEKPNKKDQNTTETIAAKTQPTKAESKAAKQSKTENEKAKNKETTTKSKKQKAKEEKKEEENKVKEDDKEKTTHEISPTKGKKEDKMNRKELEKEKKEWIKEGFTIVKGPEKESKKENKKKLEEAKAAEEAERKKKDEEKSVTEEDKKKKTVESKKQQEHPQRNIETKKAPWSAPQIGQLRDGLPLGEIQRLEREKKLEQIREQQHMVQLLAQEQAAVAAREQVINEMQANNPPWTKKKIDRPNNGTSQSFADIQAETRRQGTASAHPPPMPVEDTLTTSSQAPWANTQNGARLTKNLFFTDDTNNPADVLNTGGFWDTQPNTSKAAEKARDNRPETSKKKKPAVAASPKKESSPCAEFDTWSQSALASWSSKIDVPTFVGFLKDIESPYEVKDYVKCYLGESKDSSDFARQFLEKRSKLLRVGMVTPSDDLCSPAMAVNPRAALDYQEGKGKKSKKNKMLKVDARILGFSVTASEDRINVGDIDTV-