Monarch geneset OGS2.0

DPOGS208462
TranscriptDPOGS208462-TA3066 bp
ProteinDPOGS208462-PA1021 aa
Genomic positionDPSCF300064 - 1631155-1641308
RNAseq coverage534x (Rank: top 24%)
Annotation
HeliconiusHMEL0080543e-16157.00% 
BombyxBGIBMGA010644-TA0.069.17% 
Drosophilaaub-PA1e-12336.94% 
EBI UniRef50UniRef50_A8D8P80.069.17%PIWI n=2 Tax=Bombyx mori RepID=A8D8P8_BOMMO
NCBI RefSeqNP_001098066.20.069.17%aubergine protein [Bombyx mori]
NCBI nr blastpgi|1667068560.069.17%aubergine protein [Bombyx mori]
NCBI nr blastxgi|1667068560.069.58%aubergine protein [Bombyx mori]
Group
Gene OntologyGO:00036769e-116nucleic acid binding
GO:00055155.4e-110protein binding
KEGG pathwaynvi:1001184149e-155 
 K02156 (AUB, PIWI)maps-> Dorso-ventral axis formation
InterPro domain[691-1021] IPR0123379e-116Ribonuclease H-like
[716-1007] IPR0031655.4e-110Stem cell self-renewal protein Piwi
[147-465] IPR0031008.4e-68Argonaute/Dicer protein, PAZ
Orthology groupMCL10228 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208462-TA
ATGTCTGAGAGGGGTCGCGGACGCGCTAGAGGCCGGGCAGGTCGTGGTGGAGATGGAGCTATGCAGCCGCCTCGGAGGCCCGGAGAGCAACCTCCTCAACAAGCTGGGGCTCCCAGACCTCGGCCTCAACCGCCGTTGGCTTGGGGTCCTCCATCCGTAGTCCCGCCAGTCAGGGCTGGTATGCCCACACAGTCCGTAGGCCGAGCATCGCATCGTACCACTCCATCCACTCACGATCACCCAGGAGATGTTGATATACAACAACGAATGCAATCAGTTGCATTAGGTAGTCCATCACAGTCATCTGGTGGTGGTGATGTTGGGACTGTGGTAGGACGTGGTTCCCGTCGAGGCGGAGGAAGAGTTCTGCCTGAACAAATGACCATTGTTCGGACACGTCCAGAGACTGTGACCTCCAAGAAAGGAAGCACTGGAGCTCCGTTAGACCTCTGTGCTAACTATTTCACCATTCAGACAACTCCTCAGTGGTCACTCTATCAGTACCATGTGGATGTTGACCCCGAAGAGGACAATACTGCTGTGAGAAAAGGTCTCCTACGCATTCACGCTAAAACTTTGGGAGGGTACCTATTTGATGGAACTAACTTATACACTGTCAAAAAACTGCACCCAGACCCAATGGAACTATACTCACAGAGAACAACCGATGGTGAAAATATGAGATTGCTAATTAAGCTCACAGGTCAAGTGAGTCCTGGTGATTATCACTATATACAGATATTCAACATCATGATAAGGAAATGTTTCCGTATTCTGGATTTAAAACTCATGGGTAGAGATTTCTTTGATCCAATAGCTAAGATTGACATTCCGGAACATCGTTTACAAGTGTGGCCCGGATACAAGACTAGTATTAATCAGTACGAAGATCGTATCCTCATGGTCACTGAGATCACTCACAAAGTCCTACGTTTAGATACGGTTCTTGAAATGCTCAAGGAGTACACGTATCAGTATAAAGGTGACACATATAGGAAAATGTTCTTGGAAGATATTGTCGGCAAGATTGTTATGACGGATTACAACAAAAAGACATATAGAGTTGATGATGTTGCTTGGTCTGAGACACCGAAGTCTACTTTCAGGATGAAAGATCAAGATGTATCGTACTTGGACTACTATAATTTGAAATATAAAATCAGGATCCAAGATCCCGGCCAGCCTTTGTTGATCTCTCGTTCCAAGGAGCGTGATATAAAGCGTGGTATGCCTGAACTGGTTTACCTCGTGCCGGAGCTGTGTCGCCAAACGGGTCTCACTGATCAGATGCGAGCTAACTTTCAATTGATGCGAGCTCTGTCTACACACACCAAGATCGGTCCGGACATGCGTATACAGAAGCTACTTAATTTCAACCGCAGATTTACTCAGACTAAGGAAGTTGTTGAGGAACTGGGGACTTGGTCATTAAAACTATCGAATGACTTGGTGAGGTTCAAAGGTCGTCAATTACCAGCCGAACAAATCATCCAAGGAGGCAACATGAAGTACCCAGCTGGTGATACAAACGACGGCTGGACTAGAGATATGAGGTCTAAGAATCTGTTCTCGGTCGCTAATATGCCGTCCTGGGTAGTCATAACGCCTTCGCGCCAACAGAACGATTCACAAAAATTCGTAGATTTGATCATGAAGACCGCTTCCGGTTGCGGATTTAGAATGCCCAGACCGGAAATCGTGACCATACAACAGGACAGCCAATCCGCATACGCCAATATGTGCGAAAACGTCATAGCTAGAAAAAATCCAGCTATGATATTGTGTGTATTGGCTAGGAATTACTCGGACAGGTACATTTCAACAATAAATAATCTCGGCGGCGCTCCATGGACGGTGGAAATCCCCTTGCCTACACTGATGGTGATCGGATACGACGTGTGTCACGACACGCGTTCTAAGGAGAAGAGTTTCGGAGCTTTGGTCGCTACGTTGGACAGACAAATGACTCAGTACTACTCTTGTGTTAACGCGCACACCTCGGGAGAAGAACTCAGTTCACATATCGCCTTCAACGTAGCGTCGGCTGTACGGAAATATAGAGAGAGAAATGTGACCATACAACAGGACAGCCAGTCCGCATACGCCAATATGTGCGAAAACGTCATAGCTAGAAAAAATCCAGCTATGATATTGTGTGTATTGGCTAGGAATTACTCGGACAGATATGAAGCGATCAAGAAGAAATGCACTATCGACCGCGCGGTGCCCACGCAAGTCGTCTGCGCTAGGAACATGACAAGCAAATCGGCCATGTCCATCGCTACCAAAGTGGCCATACAAATCAACTGCAAGCTCGGCGGCGCTCCATGGACGGTGGAAATCCCCTTGCCTACACTGATGGTGATCGGATACGACGTGTGTCACGACACGCGTTCTAAGGAGAAGAGTTTCGGAGCTTTGGTCGCTACGTTGGACAGACAAATGACTCAGTACTACTCGTGTGTTAACGCGCACACCTCGGGAGAAGAACTCAGTTCACATATCGCCTTCAACGTAGCGTCGGCTGTACGGAAATATAGAGAGAGAAATGGCTTCCTGCCCGGACGTATCTTTATATACCGAGACGGCGTAGGCGACGGACAAATCGCATATGTGAAAAGCCATGAAGTAGCGGAAGTGAAAGCTAAGCTGGCTGAGATATACGGCGGCGGGGATATCAAAATGGCGTTTATCATTGTGTCTAAGCGTATCAACACGCGAGTGTTCGTGGACTGCGGCCGTAGTGGAGAGAACCCTCGCCCCGGGACCGTGGTCGATGATGTGGTCACACTACCTGAGAGATACGACTTCTATCTAGTCTCCCAAAACGTCAGAGAGGGAACGATAGCTCCGACATCATACAACATTATAGAGGACACTTCCTGCTTAGATCCGGATCGAATCCAACGCCTCACCTACAAGCTGACCCACATGTATTTCAACTGCTCGACACAAATCCGCGTGCCGTCTGTGTGTCAATACGCCCACAAGCTGGCCTTCCTAGCGGCCAACAGCCTCCACAACGCGCCCCATCACTCGTTGGCCGACACTCTGTACTTCCTATAA

Protein sequence:

>DPOGS208462-PA
MSERGRGRARGRAGRGGDGAMQPPRRPGEQPPQQAGAPRPRPQPPLAWGPPSVVPPVRAGMPTQSVGRASHRTTPSTHDHPGDVDIQQRMQSVALGSPSQSSGGGDVGTVVGRGSRRGGGRVLPEQMTIVRTRPETVTSKKGSTGAPLDLCANYFTIQTTPQWSLYQYHVDVDPEEDNTAVRKGLLRIHAKTLGGYLFDGTNLYTVKKLHPDPMELYSQRTTDGENMRLLIKLTGQVSPGDYHYIQIFNIMIRKCFRILDLKLMGRDFFDPIAKIDIPEHRLQVWPGYKTSINQYEDRILMVTEITHKVLRLDTVLEMLKEYTYQYKGDTYRKMFLEDIVGKIVMTDYNKKTYRVDDVAWSETPKSTFRMKDQDVSYLDYYNLKYKIRIQDPGQPLLISRSKERDIKRGMPELVYLVPELCRQTGLTDQMRANFQLMRALSTHTKIGPDMRIQKLLNFNRRFTQTKEVVEELGTWSLKLSNDLVRFKGRQLPAEQIIQGGNMKYPAGDTNDGWTRDMRSKNLFSVANMPSWVVITPSRQQNDSQKFVDLIMKTASGCGFRMPRPEIVTIQQDSQSAYANMCENVIARKNPAMILCVLARNYSDRYISTINNLGGAPWTVEIPLPTLMVIGYDVCHDTRSKEKSFGALVATLDRQMTQYYSCVNAHTSGEELSSHIAFNVASAVRKYRERNVTIQQDSQSAYANMCENVIARKNPAMILCVLARNYSDRYEAIKKKCTIDRAVPTQVVCARNMTSKSAMSIATKVAIQINCKLGGAPWTVEIPLPTLMVIGYDVCHDTRSKEKSFGALVATLDRQMTQYYSCVNAHTSGEELSSHIAFNVASAVRKYRERNGFLPGRIFIYRDGVGDGQIAYVKSHEVAEVKAKLAEIYGGGDIKMAFIIVSKRINTRVFVDCGRSGENPRPGTVVDDVVTLPERYDFYLVSQNVREGTIAPTSYNIIEDTSCLDPDRIQRLTYKLTHMYFNCSTQIRVPSVCQYAHKLAFLAANSLHNAPHHSLADTLYFL-