Monarch geneset OGS2.0

DPOGS208888
TranscriptDPOGS208888-TA2718 bp
ProteinDPOGS208888-PA905 aa
Genomic positionDPSCF300009 - 959585-965296
RNAseq coverage47x (Rank: top 71%)
Annotation
HeliconiusHMEL0038862e-4441.54% 
BombyxBGIBMGA002440-TA3e-2239.69% 
DrosophilaCG8483-PA3e-2334.55% 
EBI UniRef50UniRef50_Q299S21e-2134.55%GA21107 n=2 Tax=pseudoobscura subgroup RepID=Q299S2_DROPS
NCBI RefSeqXP_001953066.11e-2235.08%GF17401 [Drosophila ananassae]
NCBI nr blastpgi|1947411782e-2135.08%GF17401 [Drosophila ananassae]
NCBI nr blastxgi|3131040412e-2137.06%allergen Pol d 5 precursor [Polistes dominulus]
Group
KEGG pathway 
InterPro domain[6-169] IPR0140444.1e-35CAP domain
[7-224] IPR0012832.5e-32Allergen V5/Tpx-1-related
[39-57] IPR0024133.8e-08Ves allergen
Orthology groupMCL34777 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208888-TA
ATGATTCTTGGAAAAGCCGAAGCCAATATTATTATTAACCAAATTAATATGCGTAGAAATTTTATTGCCACAGGACGCTCCAAGTATCTACCAGCAGCAGCAAACATGAATAAAATTAAGTGGTCCGAAGAATTAGCAACTTTTGCTCAGCGTTGGGTAGACCAATGTGATCAAAGTCCGAATAAGGAAGATAGCTGTCGGGATCTTGAAAAGACAAAAGTTGGCCAAAATATAGCAACAATAGTCGGATCAACACCTGGACTGAACATTAAAAGTTTTATTGAAATGTGGTTTATGAAATCTATTTACTATAATAGCAGTGTTACTTTTTACAATCAATCTGTTGACCACAAAGCAAATTACTTCACACAATTAATTTGGGCTGAAACAGAAGAAGTGGGGTGTGGAAGAGCTAGATTTGTGATACATAATAAAAGGCCTATCCTTATTGAAAGACTTGTTTGCAATTTCGCACCAACAGGCAACGTACAAGGAAAACCGATATATATTATAGGATACCCCGCCACTCAGTGTAAAAATAGTATGAATCCTGATAAAGCGTTTATTGGCTTGTGTGAAACAAACAATACACCAAAACCCCTAACATTAAAATATAGAGATAAAATGACTACAAGAAATCCATCTACCAGCTTCCTAAGAATACTTAATTTATACAATGAATCAGCAACCCCTGAAATGGATCCTATTAAAATTTACCACAATTCTAAACCAAACGTAAAAGTTTATCATGAAAATCATAATTACTACAACAACCATAAATCAAAACTTAGAGATTTACTAAGACATCCATACGATGCTGTGAAACGCAGCTACCCTTTAGACGGTGAACACTTTTGGCGTATTCCTAATAACAATACACGAGATCACGCAGAAAATTATCAAAAAGAAAGAGGACATTCGCGAGTGTACCACGGTCACAATCATAGAAATGAATTTGATTTTTTACATCCAGAAACGTCCTCAATAGATTCTAGGAGGTTTGATATGACCACTTACACCGTAAATATGTTTTCTAATCAACAATTTAGAAAACACAATAGTGGCAACCGTTGTACAAGAAAAGGGGAAACTCAAACACAAACTGTTACAGAATGCACACCATGTGCTCAAACTGCTAAATGTACAAGACGTCAGATAAATAATTATAAAATATCGAATAATAATTGTAAACAATACCAATTTCTGACCGCATCTGACTCTATCCCACATGAGTCATGTCCGTGTAACACCTTTCAAAATTCATTTAAAACGAGATTTATGTCTTGTGAGCACAATCGCAAAACTAATTGTGGTTGTGCAGATGAAAATTGGTCAACTGAACCCTGTCGAAACATGTTACGAACTTTGAAGGACATTTCTGAAGACAGCAATAGCAATAAAGATTCATTTTATGATGACTTTCATCCTAATTTTCACTCTAATAATCCAAAAATTGTATCAAAATCACATCATTATGAGAATAGGAAGAGAAGTGTACCAGAAGAAGAAATTAATTTTAAACCATTTTGGGAAGTCGATGAATATTCTAGTAATAAACAACCACAGTTGAAATCTTTAAGATTTACAACATTATCCAACAAGAAAATTAAATCCAAAATACTTAAAAAAAATACCAGAAACTCAGCTAAAACTGAGTCTATCACGATTCCATTCGAAACACAACCGGCGTTATCAAATAAACGGGTTACAGAAAAGTATTTGTCATTTGACGAGTTATTACATCTCCGGAAATATAATGCAGAACTTAACGCACGAAGAGCTAATGAAGAAAGGGCTAGCTTTTCCAATGACGGTATTCAAATACTCAGAGAAGGTACAACTAAAGCAACTACTAAAACTGCTGCAACTACTACGACTACTGCTACTACAGCAGGAAGTCCTTCTGAATACACTGCTAATACCCCATATATTCGAATGAAACATTGCACACGTAAATTGACTTGCACGTGGACTGCCGCTTCTATGACTGACAGTAATGGGAGTATCATAACTGGAGGCGCCGATAACATTGGATCTAGAACACCGCCTGGCTACGTTGAAGGGTGTACCAGAACTTCTACTTGTACCAGAGACTACATGAATCGTAACAAAATGGCAACATTGCCCGTTGATATTACTAGCGTAGAAACTGATAATGGTGACGATGAAGATTATTGTGAACGTCGGTCTTTAAACAAACGAAATATAAATAAAAATAAAATAATTCAGAGACTTACAAAACGGACATCCATCTCGTACAATATTACTAAATCGCCAAGGAGTATTAACAGTAGAATAAACAGAACACCAAAAAACTTATTAAGTAACCGCAAAATAAAAACAAAAAATAGAAGTAAAAGATTTACAACCAGCCGGAACAATACACGAAATTCAAATATTCATAAAGTCAAAAGAGAAAATAAGATTCAGGCTGAGAATAACTTATTATCATATGGTGATATTTACTATCTTGTAACTAAAAAAATACTTAAAATGTGGAAAAAGAAAAGTATCCAACATAACCAATTTTGCTTTTGTAATCATGTCTCAAAATTAAAAAGTGATTATTATAACATCGCTTTGTCCTTCGTCATCTTGACCCAATCACCGTTAATTTTCATGAAATTAATGTCTCTCTTACTTTTCCTTGATTTATCGGTGTTTTTAAATACTTCACTCTCTTCATTTTCTGTATTATTATTGAGTAATAATTTCTGA

Protein sequence:

>DPOGS208888-PA
MILGKAEANIIINQINMRRNFIATGRSKYLPAAANMNKIKWSEELATFAQRWVDQCDQSPNKEDSCRDLEKTKVGQNIATIVGSTPGLNIKSFIEMWFMKSIYYNSSVTFYNQSVDHKANYFTQLIWAETEEVGCGRARFVIHNKRPILIERLVCNFAPTGNVQGKPIYIIGYPATQCKNSMNPDKAFIGLCETNNTPKPLTLKYRDKMTTRNPSTSFLRILNLYNESATPEMDPIKIYHNSKPNVKVYHENHNYYNNHKSKLRDLLRHPYDAVKRSYPLDGEHFWRIPNNNTRDHAENYQKERGHSRVYHGHNHRNEFDFLHPETSSIDSRRFDMTTYTVNMFSNQQFRKHNSGNRCTRKGETQTQTVTECTPCAQTAKCTRRQINNYKISNNNCKQYQFLTASDSIPHESCPCNTFQNSFKTRFMSCEHNRKTNCGCADENWSTEPCRNMLRTLKDISEDSNSNKDSFYDDFHPNFHSNNPKIVSKSHHYENRKRSVPEEEINFKPFWEVDEYSSNKQPQLKSLRFTTLSNKKIKSKILKKNTRNSAKTESITIPFETQPALSNKRVTEKYLSFDELLHLRKYNAELNARRANEERASFSNDGIQILREGTTKATTKTAATTTTTATTAGSPSEYTANTPYIRMKHCTRKLTCTWTAASMTDSNGSIITGGADNIGSRTPPGYVEGCTRTSTCTRDYMNRNKMATLPVDITSVETDNGDDEDYCERRSLNKRNINKNKIIQRLTKRTSISYNITKSPRSINSRINRTPKNLLSNRKIKTKNRSKRFTTSRNNTRNSNIHKVKRENKIQAENNLLSYGDIYYLVTKKILKMWKKKSIQHNQFCFCNHVSKLKSDYYNIALSFVILTQSPLIFMKLMSLLLFLDLSVFLNTSLSSFSVLLLSNNF-