Monarch geneset OGS2.0

DPOGS201998
TranscriptDPOGS201998-TA4074 bp
ProteinDPOGS201998-PA1357 aa
Genomic positionDPSCF300060 + 236720-252362
RNAseq coverage699x (Rank: top 18%)
Annotation
HeliconiusHMEL0026330.068.65% 
BombyxBGIBMGA010406-TA0.054.22% 
DrosophilaAGO2-PC7e-9138.29% 
EBI UniRef50UniRef50_Q59HV70.053.66%Argonaute 2 n=2 Tax=Obtectomera RepID=Q59HV7_BOMMO
NCBI RefSeqNP_001036995.20.053.66%argonaute 2 [Bombyx mori]
NCBI nr blastpgi|1667068540.053.66%argonaute 2 [Bombyx mori]
NCBI nr blastxgi|1667068540.053.50%argonaute 2 [Bombyx mori]
Group
Gene OntologyGO:00055151.7e-96protein binding
GO:00036766.9e-80nucleic acid binding
KEGG pathway 
InterPro domain[1027-1326] IPR0031651.7e-96Stem cell self-renewal protein Piwi
[1011-1331] IPR0123376.9e-80Ribonuclease H-like
[491-659] IPR0031001.1e-38Argonaute/Dicer protein, PAZ
[446-495] IPR0148114.3e-11Domain of unknown function DUF1785
Orthology groupMCL10363 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201998-TA
ATGGGACGAGGAGGAAAAAAGAAAAATGCACAAGAAGTAGAAAAAAAATTGAGTGGAGAATCATCTCAATCACAACCAAGTACATCCATTGCAACAGTTGCTGTGGAACAAAAACAAGAAGTTGGCACTGAACAAGTTACACCAACACTCGAAGAAAAAGCTGTATCAGCTCAAGCTGAGCAACCAAAAGATGATTCCAAAGAAGAAACCATTGGGTTAGGCCTCGATTTGGGTGAAGGAAAAAAAAGGCGTCCTAGGAAAAAGAAATCTACGACTGTAGCACAAATGCTGTCTGAAGATATTGCACAAAAAAATATCTCATTACAGCAGGAAACGCAATCCCATATAACACCTACGGCGGTAACAACGCAGCCGTCTACGTCTCCGGCGTTGGGGAGCACAAGTCAATTTCCAGGACTCGGTAGAGGTAGGGGGAGGGGCAGAGGTTGGGGTGTTCCTCAGCAGACTCCTCTGCAATCAAAAGGGCTACAGTTTGCTGAGCAACAATATCCCGGCCAACAGCAATATGATGCTCAATCATACCCAGGACAGCAACACCCTGGCCAGGAGCAATATGGAGCTCAATCACACCCAGGACAGCAACACCCTGGCCAGCAGCAATATGGAGCTCAATCACACCCAGGACAGCAACACCCCCCTGGCCAGCAGCAATATGGAGTTCAATCACACCCAGGACAGCAACACCCTGGCCAGCAGCAATATGGAGCTCAATCACACCCAGGACAGCAACACCCTGGCCAGCAGCAATATGGAGCTCAATCACGACCTAGACAGCAACAATCTTCACAAATGCAACAAAGCAGGTCATCGAGTATTGCTTCTATAGCATCATCTTCGAGCAGCGGTGCATTGAGCCGGTATAAAATACCGGCAAGAATAGAAGGCCGCTCGGTCAGGAGTACAAATATTAAAGTTTTGACCAATTACTTGGAAATGAAAATTAAGGATGTCAAAGTTCATCGCTACGATATTTCGGTTAAACCCGACAAGCCCAAGAAAATTGCACAGAAGGCTTTTGCTGTAGTGAAACAACAATATTTTCCTAAATCTGTAATTGCGTTCGACCAAAGGAAGAACTGTTACACTTTAGCACCTCTCTGGAAAGGTCCTGATGAAAGAGTCAGTTATGATGTGGAGGTGATTGACGATAATAAAGTTAAAATGCCCTTCGAGGTGTCATTGAAGGTTACTGGCGTAATTGATCTCGGGAGGATCTTAAATCACATCAAAACTGGAGACTCCTCTTTGAATACGCCCAGCGAAGCTATACAGGCCATTGATGTTATTTTACAACAGGGCACACTTGAAAATTATGTTAAGGTCGGAAGACAATTCTTCAAACGTCCACAGAATCCAATAGATTTAGGTTTTGGATTGGAAATGTGGACGGGTCTATTCCAGTCGGCCATATTCACTTGGCGGCCGTTCATAAATATCGACGTGGCCCACAAAGGTTTTCCGCGGACTCAATCTGTACTGCAGGCTTACATGAATGATTTTAAATTGAATCCAAATCAATGCCTTGAACAACAAAGGGGATATAATGTCGAATTATTCAGGCAATATTTAAAGGGACTTAAGGTGAAGGCGTTTGTAGGAGGCGACAGTAGTGGCAAAGTACGTGAATACATTGTCAATGATGTACAGGATCCTCCGAGTAAATTAACATTTGAAATGACGGATGCGAGTGGAAAAACCAAGAGAATGTCGGTGGCTAACTATTTCCTTACTGAGAAAAAATTAAGACTGCAATATCCAAACCTGAATTGTTTGTGGGTCGGTCCGAAAAACAAGAACATCTTCTACCCAATGGAATTACTTCAAGTCTCTTATGGCCAGGCATTAAACAAGCAACTGAATGAAAAACAGCTACAGACTATGGTACGCGAGGCTGCGACTCCGCCGAATGAGAGATTGAAGAAAATTAAGGAAGTTATACACAATATGAATTATCCAGCAAATCCAATATTCAAGCATTTTAAATTGGAGATTGAGAAGGACTTTTTCAAAGTCGATGCAAAAATATTGCAGGCGCCAAAGTTAGATGTTGGTGCTGGAAGAGGGGTCGTGCCCAGGAATGGATCGTGGCAGGCTAATCGCTTCTTAAAACCCAGCGCATTGGAATCCTGGGGACTCATAGTAGTTGATGCAGATCCAAATGCTTGTAGATGCGAAGAGTTCATGCAAAATATAAACCGTATCGGGAAGCAGATGGGGATGACTGTGAACGCACCCAAGTATTACAACTTCAATGTTAAGCCTTTTGATTTGAAGAGAACTTTGTATGCTGCATCTGAAAAGGGTGTCAATTTCTTATTCATCATTGTAGGCGGAAGAGACAAGAATTGCTATCATAAAGTTAAACAAATAGCGGAGTTGGAGGTCGGATTACTAACGCAATGCATAAAAGAGTTCACAGCAAAAGCTAAGATGAGCGATCAAACCATACGTAATATACTTTTAAAGGTTAATTCAAAACTGATGGGTGTGAACCAAGCATTGGACGCGTCATCCATACCGAGGTGCATCAGTGAAGGCGGTGTTATGTTGGTTGGCGCTGACGTCACTCACCCCTCCCCTGATCAGGCATTAAACAAGCAACTGAATGAAAAACAGCTACAGACTATGGTACGCGAGGCTGCGACTCCGCCGAATGAGAGATTGAAGAAAATTAAGGAAGTTATACACAATATGAATTATCCAGCAAATCCAATATTCAAGCATTTTAAATTGGAGATTGAGAAGGACTTTTTCAAAGTCGATGCAAAAATATTGCAGGCGCCAAAGTTAGATGTTGGTGCTGGAAGAGGGGTCGTGCCCAGGAATGGATCGTGGCAGGCTAATCGCTTCTTAAAACCCAGCGCATTGGAATCCTGGGGACTCATAGTAGTTGATGCAGATCCAAATGCTTGTAGATGCGAAGAGTTCATGCAAAATATAAACCGTATCGGGAAGCAGATGGGGATGACTGTGAACGCACCCAAGTATTACAACTTCAATGTTAAGCCTTTTGATTTGAAGAGAACTTTGTATGCTGCATCTGAAAAGGGTGTCAATTTCTTATTCATCATTGTAGGCGGAAGAGACAAGAATTGCTATCATAAAGTTAAACAAATAGCGGAGTTGGAGGTCGGATTACTAACGCAATGCATAAAAGAGTTCACAGCAAAAGCTAAGATGAGCGATCAAACCATACGTAATATACTTTTAAAGGTTAATTCAAAACTGATGGGCGTGAACCAAGCATTGGACGCGTCATCCATACCGAGGTGCATCAGTGAAGGCGGTGTTATGTTGGTTGGCGCTGACGTCACTCACCCCTCCCCTGATCAGACCTCAGTTCCCAGTATTGCAGCCGTCACAGCATCTTTTGACAATTTCTGTTTCAAGTATAACATCGAACTCAGCGTACAGACACCAAAGGCTGAAATTATTGTGGAATTCGAAGACATGATCTTTGATCACTTGCAAGTCTATAGAAAGAATCAAAGATGCCTTCCAAGGAAGATATATGTGTTCCGAGATGGCGTGTCGGAGGGACAGTTTGCTCAGGTTATGAACAGTGAATTGGTTGCTATTGAAAAGGCCTACCGTCGCCTGGACCAAGAAAGGAAACCGGAAATTCTCTTCCTGTTGGTACAGAAGAGGCATCATACGAGGTTCTTTTTGGGAGAGCATGATCGTCAGAACGTTGAACCTGGTACAGTGGTCGACACAGAAATCGTTCATCGCAGTGAACTTGACTTTTATTTGGTGTCCCATTCAGCTCTCAAAGGTACAGCTCGTCCTACGAGATACCACGCGGTCGCTAACGACGGTGGTCTTCCTTCTGACGAAGTCGAACAGCTGACATATTACTTGTGTCACTTGTACTCCAGATGCATGAGATCAGTTTCATACCCAACGCCCACATACTACGCTCACCTAGCGTGTCTCAGAGCTCGATCCCTTACATTCGGCGATCACTTCAACAACAGCGATCTGGAAAAGGAACCGAAAAGACTTCGCGTCATGGACAAAATGCTAAATTGCAGTCGCATGTTTTTCGTGTAG

Protein sequence:

>DPOGS201998-PA
MGRGGKKKNAQEVEKKLSGESSQSQPSTSIATVAVEQKQEVGTEQVTPTLEEKAVSAQAEQPKDDSKEETIGLGLDLGEGKKRRPRKKKSTTVAQMLSEDIAQKNISLQQETQSHITPTAVTTQPSTSPALGSTSQFPGLGRGRGRGRGWGVPQQTPLQSKGLQFAEQQYPGQQQYDAQSYPGQQHPGQEQYGAQSHPGQQHPGQQQYGAQSHPGQQHPPGQQQYGVQSHPGQQHPGQQQYGAQSHPGQQHPGQQQYGAQSRPRQQQSSQMQQSRSSSIASIASSSSSGALSRYKIPARIEGRSVRSTNIKVLTNYLEMKIKDVKVHRYDISVKPDKPKKIAQKAFAVVKQQYFPKSVIAFDQRKNCYTLAPLWKGPDERVSYDVEVIDDNKVKMPFEVSLKVTGVIDLGRILNHIKTGDSSLNTPSEAIQAIDVILQQGTLENYVKVGRQFFKRPQNPIDLGFGLEMWTGLFQSAIFTWRPFINIDVAHKGFPRTQSVLQAYMNDFKLNPNQCLEQQRGYNVELFRQYLKGLKVKAFVGGDSSGKVREYIVNDVQDPPSKLTFEMTDASGKTKRMSVANYFLTEKKLRLQYPNLNCLWVGPKNKNIFYPMELLQVSYGQALNKQLNEKQLQTMVREAATPPNERLKKIKEVIHNMNYPANPIFKHFKLEIEKDFFKVDAKILQAPKLDVGAGRGVVPRNGSWQANRFLKPSALESWGLIVVDADPNACRCEEFMQNINRIGKQMGMTVNAPKYYNFNVKPFDLKRTLYAASEKGVNFLFIIVGGRDKNCYHKVKQIAELEVGLLTQCIKEFTAKAKMSDQTIRNILLKVNSKLMGVNQALDASSIPRCISEGGVMLVGADVTHPSPDQALNKQLNEKQLQTMVREAATPPNERLKKIKEVIHNMNYPANPIFKHFKLEIEKDFFKVDAKILQAPKLDVGAGRGVVPRNGSWQANRFLKPSALESWGLIVVDADPNACRCEEFMQNINRIGKQMGMTVNAPKYYNFNVKPFDLKRTLYAASEKGVNFLFIIVGGRDKNCYHKVKQIAELEVGLLTQCIKEFTAKAKMSDQTIRNILLKVNSKLMGVNQALDASSIPRCISEGGVMLVGADVTHPSPDQTSVPSIAAVTASFDNFCFKYNIELSVQTPKAEIIVEFEDMIFDHLQVYRKNQRCLPRKIYVFRDGVSEGQFAQVMNSELVAIEKAYRRLDQERKPEILFLLVQKRHHTRFFLGEHDRQNVEPGTVVDTEIVHRSELDFYLVSHSALKGTARPTRYHAVANDGGLPSDEVEQLTYYLCHLYSRCMRSVSYPTPTYYAHLACLRARSLTFGDHFNNSDLEKEPKRLRVMDKMLNCSRMFFV-