Monarch geneset OGS2.0

DPOGS200325
TranscriptDPOGS200325-TA2079 bp
ProteinDPOGS200325-PA692 aa
Genomic positionDPSCF300026 + 215604-223216
RNAseq coverage241x (Rank: top 43%)
Annotation
HeliconiusHMEL0151022e-9253.19% 
BombyxBGIBMGA005621-TA1e-3149.41% 
Drosophilakrimp-PA9e-1125.29% 
EBI UniRef50UniRef50_Q7PR141e-2830.03%AGAP002475-PA n=1 Tax=Anopheles gambiae RepID=Q7PR14_ANOGA
NCBI RefSeqXP_001660530.12e-2429.07%hypothetical protein AaeL_AAEL009987 [Aedes aegypti]
NCBI nr blastpgi|3479679424e-2830.03%AGAP002475-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3479679422e-2929.85%AGAP002475-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00036769.5e-09nucleic acid binding
KEGG pathway 
InterPro domain[515-612] IPR0081911.7e-17Maternal tudor protein
[541-599] IPR0029999.5e-09Tudor domain
Orthology groupMCL21910 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200325-TA
ATGAGTTTTGTAGACCTAGCGACTTTAGCAATGCACCGCCGGCAGGAATTGGATGAAAGAATAAAAGTTTTTCTAGATAAATGCGATCTCGCCATCGATGGAATAGAAAATCTTAAAACACAGGCCAAAAGCTTCGTCGATCACTTAAATTGTGAACCGTCAGAACTGAAAGACCGATTTCTCAATTGTATGAAGAAATTAGATGAATATAATTATCTCATAACAGATTTTAATTTAGCTGTGAGTAAAATTGATTCAAACAGCACTCCAAGCAGCACTCCCGTTCAAGACATCTCAACTAGTTTAATGGTACCAGAATCTGTTAGTTATAACCTCCTTGATGCAATCAGTGAACCGAAACCCTCAACCTCGAAAGCTCCACAGAAACATAAAAAGAAGAGAGTGCCAAGGGAGGATCAACTTCTCATAACATTTTCAAGCAATTCCTGTTCATCGGAGAGTGTTCCAATAACAAAATCTGAAAAGAATATCGAATCTGATATTGAAGAAAATCTTAAAAAGATCTCCTTAGATGATTCTTGCAGTGAATGCAACCTGCCAGCCCAGAGCATCTTGCAGGTGGACAACATTTACCCAGCTGTTATAGTCCATGTAGACGGCATATCATTCTGGGTTATAACAGACAACCTGACGGAGGTCTGTGATTTGATGACAGAAATGACAAGTTACTATAAAGAGAATCCGGTGCAGCTAAATCTCGACCAAGTAAGGGAGTTAACCTACTGCGCATATTATGACAACGAAGACAATGAGTGCTTCTATAGAGGACTGTTCATTAGACTGAGTGAGGACGACATGAGTATGGCGGAGGTGTTCCTGGTGGACACAGGTGAGACTCGCCGGGTTCAGTCTTCATCGCTTCAGCCTCTGCTGCCGCAGTTTGCTAGCACACCACCACATGCACGGTGCTGCCATCTAGCGGGGCCAGTTTCAGATGCACTCGATAGCTATGAAGATCTGATAGAGGGAAATCTGAAGAAGTACATCGGGAAGACATGCAAGATTAAGGTTGATGATAACACATCGGAATCACTGGGGGTGTATGTGATCGTTAAGTCCGATACATCACCAGTCCATGAGATTCTCAACGACATGATCCTCAAGGAGAGCTTGGCTTTAACAGATGATAAAACCTCTCGAGGATGTGGGTCGGAGGTCGATGGGTTTGATGAGTCTGACTTCGATATAGCAAACTGTCCCGAATACGAGGACCCTCTCGAAGCTGTTACGGGTTATCGCAACCGTGATGAAATAGACATCTGCAAGCACTACAAGGGAGGAGCGGACAGGACCTGCTTTAAGGGTTCCAGGTGTACGAAGAAACATGTAGTCAAACATCCAGATGGCTGGACGCTAGATCAGGTGCCAGTGGTGGCCAAGTACCGGCCGCTGCCGCTCCCCGCCCCTGACGCGTGGCTGAAGGTCAAGGTCACGCACGTCGCGCACTTTGACCGCTTCTACGTACACATTGTGGACGAGAAACAGGTCAAGTGTCCAGGTCCTCCCAGTTTCGGCGTTGTGCTGCCTCCGAGGAGCCTGGAGGAGCTGGTCACTGACATGAACAGTAACGCCGCCCGCATGTCCTACAAACAACTCAAGATAGTGCCAGCGCCTGGCGAGCTGGTGGCGGCGTTGTACCTGGACGGTATGTGGTACCGAGCGAGAGTGGTGTCTTCCACACGAGCTGACCAAAATGTAGAGGTGATGTACATAGACTACGGTAACGTAGTGTGGGTGAAGGAGGACGCGATCCGTGTGCTGGAGCCTCGTTACTGGGCGCTAGAGGCGCAGGCCTGTCGCTGCGCCCTGGCCGGGGTGCTGTCCACCACCAGCGACTCCCGACACTGGGCGGCCGCTAGGAATCAGCTCACCACACTCATCAACGACAGGACCCTACGGGCTCATGTCATAGCTCGGGATTACGACGAAATAACAGTGGAACTGTTTGATGATAAGGATCAGAGCATCGGCGAGCTCATGGCGGCCGGAGGTTACGTGAAGCTCGAACACTATGACGTCATATCCGACACCGGCCGCACGCAGATCGTTGTACCTTAA

Protein sequence:

>DPOGS200325-PA
MSFVDLATLAMHRRQELDERIKVFLDKCDLAIDGIENLKTQAKSFVDHLNCEPSELKDRFLNCMKKLDEYNYLITDFNLAVSKIDSNSTPSSTPVQDISTSLMVPESVSYNLLDAISEPKPSTSKAPQKHKKKRVPREDQLLITFSSNSCSSESVPITKSEKNIESDIEENLKKISLDDSCSECNLPAQSILQVDNIYPAVIVHVDGISFWVITDNLTEVCDLMTEMTSYYKENPVQLNLDQVRELTYCAYYDNEDNECFYRGLFIRLSEDDMSMAEVFLVDTGETRRVQSSSLQPLLPQFASTPPHARCCHLAGPVSDALDSYEDLIEGNLKKYIGKTCKIKVDDNTSESLGVYVIVKSDTSPVHEILNDMILKESLALTDDKTSRGCGSEVDGFDESDFDIANCPEYEDPLEAVTGYRNRDEIDICKHYKGGADRTCFKGSRCTKKHVVKHPDGWTLDQVPVVAKYRPLPLPAPDAWLKVKVTHVAHFDRFYVHIVDEKQVKCPGPPSFGVVLPPRSLEELVTDMNSNAARMSYKQLKIVPAPGELVAALYLDGMWYRARVVSSTRADQNVEVMYIDYGNVVWVKEDAIRVLEPRYWALEAQACRCALAGVLSTTSDSRHWAAARNQLTTLINDRTLRAHVIARDYDEITVELFDDKDQSIGELMAAGGYVKLEHYDVISDTGRTQIVVP-