Monarch geneset OGS2.0

DPOGS215948
TranscriptDPOGS215948-TA2736 bp
ProteinDPOGS215948-PA911 aa
Genomic positionDPSCF300308 + 106881-111625
RNAseq coverage197x (Rank: top 47%)
Annotation
HeliconiusHMEL0041620.048.10% 
BombyxBGIBMGA001854-TA3e-8846.52% 
DrosophilaCG8414-PA1e-4326.19% 
EBI UniRef50UniRef50_B4KQB27e-4528.11%GI20408 n=1 Tax=Drosophila mojavensis RepID=B4KQB2_DROMO
NCBI RefSeqXP_002005305.11e-4528.11%GI20408 [Drosophila mojavensis]
NCBI nr blastpgi|1951215942e-4428.11%GI20408 [Drosophila mojavensis]
NCBI nr blastxgi|1951215941e-4423.95%GI20408 [Drosophila mojavensis]
Group
KEGG pathway 
InterPro domain[733-858] IPR0106551.2e-13Pre-mRNA cleavage complex II Clp1
Orthology groupMCL14438 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215948-TA
ATGGAATTTTTCGAAAAAGCACACGTTGCCTGCAGCAATGCAACATCTTTAAATAATAAGTCAAAAAAAAATGTTAAGAAACAACTGAAACAAATGTTACATGGCTATAAACAAAGTGATATTCATCTTATAAAGTCATCGGCTCTCGACAAAACAGAGTATGATAGTAATTCTACATCAGTTAGTGGATATTCAGACTTAAATCTGACCGATTCTGGCGACGATGAACATTCAACTCGGAAGGAGTTGGTAGAGAGTGTAACAGTAGACGAAAATGATTCCTTGGATAGAAGTGCCAGTAATACTATACATAAAGTTACTTCATATAAGTCTTCTGAAATTAGTGCTAGCAAGGATAAAAATTCTATTACTGAAGAATCCGATGATGTCTTTTCACTAGTAGACAGTGAGGCATCTAATGCAAATAGTCTGATTAAGTCAGAAAGTGAATCAAACAGTTCAGAACCATTTGACCCAGATGGTTTACTGGCTGCAAAAATTATTCAAAGACTTCAAATCAACAATGAAAAGAAAAAAAACCTCAAAAGAAAACATGATATTGAATCTAAGAAGCATAAAGTCTCAAAAGTCATTGAAAGTAAAAGTAAATCTGTAAAAGGTCCCGTCATTCCATCAGTATCTGAAGAAGTTCAACAATTCTCCGATGACGACTCATCTGTCTGTGCCATAGCTGTAGAAAAAGATTCCTCTGAATTTTCTCCTCCATATGTTTCTATTTCAGCCGTTGATGTACCACTTCTAAGTTTCACAGATGTTTTAGGAGATTACAAACATACAACTATAAAAAACATACCATCCCAACAAAAATCTGTAACAGTGGATGAGTATTCAAATTTTATTGTAGATAAATCTTTGAGCGTTGACGCTAAAGTTGAGACAGATATTGGAAGTTTGACTCCTGAACCTTCATCGGATAACGAGATTGGTGAAATAGAGGAAATAATCAATCTTGACTCAACAACAGAAACGATAGAAACGGATACTCCTATACCAACTAATGTACCTTTTGATGATACCATGCAATCGGAACCGTTGAGCGATAAAGAATCTTCAACGGACGAGGATTACTACAAGGTTATAGATACATTCAAAATATATTATGGCAACAAATGCTGCATCATCATATTGAAACATCCAACAGAATTGTTTGTTCAAGGAAAAGTTGCAATAAAAGTTTTAGACGGCTCAATTGATATATTCGGTTACACCCCAAAGGACGATACTTGTAAATTATACGCTCCCTTTTACAGCTACGCTCACAGTATAAAAACTACCGAAGAACCAAATGATTATTATGGGTTGTTCGGGAAATTAACTGAAGCCGGTCTCTCGGTCGCAGAGGCTGAAGAAATAGTGATTACTATTGGGGAACACGATGGAGTCGTACTTTTAAAGCCGCTTGTGAACCAGTGTTTAGATTTTGTAGAAAATAATTTTAAAATAACAAACTTGTTTGTAAGATCTGTGAAAAACATTGAACCCTTCTTTACAAAAGCAACTGATATTTTAAATTGTTCTCTATTCTCAATAAGGCCAGCTAGATGCTTCAAAGTCCATCCCAGTTGGAAAGAGGCACTTAAATATTCTCAGGAAAAACATAGCCGTGGGATTATTTGTGGCGGCAAAGGTGCTGGAAAATCTACATATTTGAGATACCAAGTAAATAAATTAATCTCTCAAGGACCAGTTTTAGTTGTGGATCTTGACCCTGGGCAGTCCGAGTTCACAGTGGCCGGTGGTATATCAGCTACTACCGTGTCCGAACCACTATTAGGGCCAAGTTTCACACATCTAAAGAAGCCAGACATAATGTTTAATATCGGCATGATAAACACAATGGACAATGCGAGGCGTTATGTTGCAGCACTGCAGCAATTGTTATCACACTGCCGCAATCACAAACCGTACTCCGAAATGCCGTGGATAGTCAACACAATGGGGATGACAAACTTCCTAGGGCTCAAGTTCATAACACTTATCGTAATATTAACGCAGCCAACGTATCTTCTGCAATACGAATCCAAGAATTCTAAACGAAGATTTGAAAGCTTCTTAAGACCTTCCAACGTAAAACTTGTATTCCAAGATAATGAAAGCGATCCCTTGTTCAGCAACATCACCTTCCCGGAGCAGTTGAATTATAAGTTTGTAGTGGCAGACGAGGCGGATAGCTTCTTGAAAAATGGCTACTCTTTATCACCTAGAGACGAAAGATACCTAAATTTCTTGGCCTATTTTGGTCAATTACTAACTGTCCACAAACTAAAGAGTTTGCTTGAAATAACGCCTTATCAGGTGAACTTGAAAGATATCAACGTCGCTACCAACGTGATCGTAATGAAGGAACGTATTACAAAAGTTATCAATGGACAAATCGTGGCACTGTGTCAGCTGTTGAGACAATGCGACAATAGAGTTTTTACATTAGACGATAAACCATTCGTATGTTATGGATACGGTATTGTCCGAGGAGTTGATTGGGATAAGGAGGTTCTATATATAATTACACCATTGGAAGGCGATTTCTTAGCATGTGTGGATACTTTGGTGTATGCAGACTGGAGTCCCGAGTTAGTGGGGCTAGAGACATGTCTACCGAATGGCACAAGCATCCCCTACCGCACCTACACAAGAAACAAGCATATACAGCTCATGTCCACACCAAAGAGGAGATTCAATCCACTTCAGCTTATAAAGATGACAAGGAATGCCTAA

Protein sequence:

>DPOGS215948-PA
MEFFEKAHVACSNATSLNNKSKKNVKKQLKQMLHGYKQSDIHLIKSSALDKTEYDSNSTSVSGYSDLNLTDSGDDEHSTRKELVESVTVDENDSLDRSASNTIHKVTSYKSSEISASKDKNSITEESDDVFSLVDSEASNANSLIKSESESNSSEPFDPDGLLAAKIIQRLQINNEKKKNLKRKHDIESKKHKVSKVIESKSKSVKGPVIPSVSEEVQQFSDDDSSVCAIAVEKDSSEFSPPYVSISAVDVPLLSFTDVLGDYKHTTIKNIPSQQKSVTVDEYSNFIVDKSLSVDAKVETDIGSLTPEPSSDNEIGEIEEIINLDSTTETIETDTPIPTNVPFDDTMQSEPLSDKESSTDEDYYKVIDTFKIYYGNKCCIIILKHPTELFVQGKVAIKVLDGSIDIFGYTPKDDTCKLYAPFYSYAHSIKTTEEPNDYYGLFGKLTEAGLSVAEAEEIVITIGEHDGVVLLKPLVNQCLDFVENNFKITNLFVRSVKNIEPFFTKATDILNCSLFSIRPARCFKVHPSWKEALKYSQEKHSRGIICGGKGAGKSTYLRYQVNKLISQGPVLVVDLDPGQSEFTVAGGISATTVSEPLLGPSFTHLKKPDIMFNIGMINTMDNARRYVAALQQLLSHCRNHKPYSEMPWIVNTMGMTNFLGLKFITLIVILTQPTYLLQYESKNSKRRFESFLRPSNVKLVFQDNESDPLFSNITFPEQLNYKFVVADEADSFLKNGYSLSPRDERYLNFLAYFGQLLTVHKLKSLLEITPYQVNLKDINVATNVIVMKERITKVINGQIVALCQLLRQCDNRVFTLDDKPFVCYGYGIVRGVDWDKEVLYIITPLEGDFLACVDTLVYADWSPELVGLETCLPNGTSIPYRTYTRNKHIQLMSTPKRRFNPLQLIKMTRNA-