Monarch geneset OGS2.0

DPOGS208430
TranscriptDPOGS208430-TA2796 bp
ProteinDPOGS208430-PA931 aa
Genomic positionDPSCF300095 - 236140-245320
RNAseq coverage144x (Rank: top 54%)
Annotation
HeliconiusHMEL0072150.071.59% 
BombyxBGIBMGA009037-TA0.077.11% 
DrosophilaAGO3-PD0.044.11% 
EBI UniRef50UniRef50_A9ZSZ20.069.39%AGO3 n=3 Tax=Obtectomera RepID=A9ZSZ2_BOMMO
NCBI RefSeqNP_001098067.20.069.49%argonaute 3 [Bombyx mori]
NCBI nr blastpgi|1646055050.069.39%AGO3 [Bombyx mori]
NCBI nr blastxgi|1646055050.069.49%AGO3 [Bombyx mori]
Group
Gene OntologyGO:00036761.4e-155nucleic acid binding
GO:00055152.6e-121protein binding
KEGG pathwayaag:AaeL_AAEL0078230.0 
 K02156 (AUB, PIWI)maps-> Dorso-ventral axis formation
InterPro domain[478-931] IPR0123371.4e-155Ribonuclease H-like
[625-917] IPR0031652.6e-121Stem cell self-renewal protein Piwi
[192-500] IPR0031003.6e-52Argonaute/Dicer protein, PAZ
Orthology groupMCL10552 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208430-TA
ATGGCTGATGCTGGGAGGGGTCGCGGTCGGGGCCTTGCCTTGCTTCAGGCGCTTAAAGCACGAATGACTGAGTCACCAGCCTCTCAGGAACCAAGTCAAGACCCCTCTGCACCACCAAGTGTTGCTGCAACACCAAATGTTGCTGTAAAACCCAATTTGGTGCCTGACACAGCATCCAGTGCAGCCCCAAGTAATGTTAGTTCAACAAGTACTATACCTGGGGGAAGAGGGAAAATGGCAGCAATGCTGCTGTCCAAAATCCAGAAACCTGGAGTTGATAAGCCTATGTTCAGTCCAGCATCTGATGTGAGTCCAAGCCCGAGTATGGTGGGACAAGGGGTTGGACGTGGACTACAGCTTCTGCAGAATTTGAGAAAGCAAACTTCTGCAAGTTCAACCGTAGACTCGAGTGTAGAAAATATAACAAAAGGTCTTGCAGCTTCCTCTGTATCTTCAGCTAATTTACCCGGAGGGAAGAACAAATATTATAGTGAAATCAGTCAAACAGAGCCAGTGGTTATGAAGGGAGAGAGCGGTACTCCGTGTGACCTCACAGCCAACTTTATATATCTTAAATATAAAGACAACAGTGTGTTCGAATACGAAGTGAGATATGAACCAGATCAGGATTACAAGCATTTGAGATTTAAACTTCTAAATGAACATAATCACTTCTTCCAACAGAAGGCATTTGACGGCACAACTCTTTACGTACCTCACAAATTGCCAGACGAAGCTCTCAATTTAGTTTCAACCAATCCTTATGATGATAGTAAAGTTAATATAACTATTTTATATCGTCGCCCTCGTCTCCTCCGTGAAATGATACATATATATAATATGCTGTTTAAACATATAATGAGAGATCTGAATCTTGTGAGGTTCGGACGACAGCACTATAATGAGAATGCTGCAATACAAATACCACAATATAAACTTGAAGTATGTCCAGGTTATGTGACAGCGGTTGATGAGTATGAAGGTGGTCTCATGTTGACTCTAGACTCGACTCATCGTGTGCTTCGTACACAGACTGTGTTGTCACTGATCAAGGAGACCGTTCAGACTCAAGGGGCTGCATGGAAGAGATATATTAGTGATGCACTCATCGGAACATCAGTTATGACCACATATAATAAAAAACTGTTTCGAGTTGACAGCATAGATGACACGATAAACCCGAGATCAACGTTTGAGAAGAATGAAAAGGGAAAAATGGTTAAAATTACGTATCTTGAGTATTACAAGAATAAAGGTATTGATATTATGGATATGGACCAGCCAATGTTGATATCGAGAGATTCCAAGCGTATGCCCGGCTCAGAGGAGATTACTGATTTCATGATATGCCTCGTACCGGAACTGTGCCAACTGACTGGCTTGTCTGACAGTCAGAGAAGTAACTTCAAACTAATGAAAGATGTCGCCACTTACACCAGAATAACACCGAATCAACGGCATGCTGCTTTTAAAAAGTACATCCAAAATGTTCTAGAAAACGAGTCCGCCTTGAATCGCCTAAAGGGCTGGGGTCTCACAATAGCCCCTGAGACGGTTGAGATAGCTGGCCGTACTTTGGCACCAGAGACTTTATATTTTGGCAATAACGTCAAGGTACCGGGACAGGCGAATTCCGAATGGAACGGTGACGTCGGAAGGAACGGTGTGATGCAAGCCGTGGATATATTAAGATGGGTTGTACTGTTCACTGATCGGGATAAACAGGTGGCTTCGGATTTCGTAGAAACTCTAAAGCGCTGCAGCCGTCCGATGGGTATAAACGTGTCTAACCCGGATATGGTCCGACTCCCAAACGATCGGACCGATACTTATGTGATGGCACTCAAGAAATGTATCAGCAGTCAGTTACAAGTGGTGGTAGCCATCTGTCCTACCATCAGAGACGATAGGTACGCTGCCATCAAGAAGATATGCTGCGCTGAGAACCCGGTACCATCTCAGGTGATCAACGCACGTACGATAATGAACAATCAGAAAATAAGATCGGTTACACAGAAGATATTACTGCAAATTAATTGTAAGCTGGGAGGTACCTTGTGGCATATCAGTATACCGTTCAAATCGGCTATGGTTGTCGGTATAGATTCATACCATGACGCCAGCAGGAAGAAACGTAGCGTGTGTGCATTTGTGGCTTCATATAACCAGTCAATGACGCACTGGTATTCCAAGGCGGTATTCCAAGAGAGGGGTCAAGAAATAGTGGACAGTCTCAAATCCTGTTTAGTCGACGCCCTCAAACATTATCTAAGAATAAATGGAAGATTACCCGATAGGATTATAATATATAGAGACGGCGTGGGCGATGGTCAGTTGAAACTTCTCAAAGAGTATGAGATACCTCAAATGGAAGTAAGTTTTTCGGTTGTCGATGACACCTACAAGCCCACTTTAACCTACATCGTCGTACAGAAACGTATTAATACGCGAATATTTTTGAAGTCTAGAGAAGGTTACAGTAATCCCGCACCAGGAACAATAGTTGATTATAAGATAACTAGACGGGACTGGTACGACTTCTTAATAGTATCACAGAAGGTGAACCAGGGTACCGTGACCCCTACTCACTATGTGGTCGTTAGTGACAACAGTCCAATGTCCCCGGACCAGTGTCAGAGGTTAACATACAAATTGTGTCATCTGTATTACAATTGGCCCGGCACTGTTCGTGTGCCAGCGCCTTGTCAATACGCACACAAGCTGGCATCGCTTGTAGGGCAGAATATACACCAGCAGCCGTCTGAAGCGTTATCTGATAAGCTATTTTTCTTATAA

Protein sequence:

>DPOGS208430-PA
MADAGRGRGRGLALLQALKARMTESPASQEPSQDPSAPPSVAATPNVAVKPNLVPDTASSAAPSNVSSTSTIPGGRGKMAAMLLSKIQKPGVDKPMFSPASDVSPSPSMVGQGVGRGLQLLQNLRKQTSASSTVDSSVENITKGLAASSVSSANLPGGKNKYYSEISQTEPVVMKGESGTPCDLTANFIYLKYKDNSVFEYEVRYEPDQDYKHLRFKLLNEHNHFFQQKAFDGTTLYVPHKLPDEALNLVSTNPYDDSKVNITILYRRPRLLREMIHIYNMLFKHIMRDLNLVRFGRQHYNENAAIQIPQYKLEVCPGYVTAVDEYEGGLMLTLDSTHRVLRTQTVLSLIKETVQTQGAAWKRYISDALIGTSVMTTYNKKLFRVDSIDDTINPRSTFEKNEKGKMVKITYLEYYKNKGIDIMDMDQPMLISRDSKRMPGSEEITDFMICLVPELCQLTGLSDSQRSNFKLMKDVATYTRITPNQRHAAFKKYIQNVLENESALNRLKGWGLTIAPETVEIAGRTLAPETLYFGNNVKVPGQANSEWNGDVGRNGVMQAVDILRWVVLFTDRDKQVASDFVETLKRCSRPMGINVSNPDMVRLPNDRTDTYVMALKKCISSQLQVVVAICPTIRDDRYAAIKKICCAENPVPSQVINARTIMNNQKIRSVTQKILLQINCKLGGTLWHISIPFKSAMVVGIDSYHDASRKKRSVCAFVASYNQSMTHWYSKAVFQERGQEIVDSLKSCLVDALKHYLRINGRLPDRIIIYRDGVGDGQLKLLKEYEIPQMEVSFSVVDDTYKPTLTYIVVQKRINTRIFLKSREGYSNPAPGTIVDYKITRRDWYDFLIVSQKVNQGTVTPTHYVVVSDNSPMSPDQCQRLTYKLCHLYYNWPGTVRVPAPCQYAHKLASLVGQNIHQQPSEALSDKLFFL-