Monarch geneset OGS2.0

DPOGS206058
TranscriptDPOGS206058-TA1794 bp
ProteinDPOGS206058-PA597 aa
Genomic positionDPSCF300028 - 628468-633773
RNAseq coverage810x (Rank: top 16%)
Annotation
HeliconiusHMEL0140700.077.30% 
BombyxBGIBMGA006841-TA0.070.70% 
Drosophilayu-PC2e-7240.99% 
EBI UniRef50UniRef50_D6X2B23e-12141.78%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6X2B2_TRICA
NCBI RefSeqXP_971237.16e-12241.78%PREDICTED: similar to a kinase anchor protein [Tribolium castaneum]
NCBI nr blastpgi|910902841e-12041.78%PREDICTED: similar to a kinase anchor protein [Tribolium castaneum]
NCBI nr blastxgi|910902841e-11841.95%PREDICTED: similar to a kinase anchor protein [Tribolium castaneum]
Group
Gene OntologyGO:00036764.8e-10nucleic acid binding
GO:00037235.2e-10RNA binding
KEGG pathway 
InterPro domain[405-521] IPR0081912e-20Maternal tudor protein
[453-510] IPR0029994.8e-10Tudor domain
[303-363] IPR0181115.2e-10K Homology, type 1, subgroup
[298-368] IPR0040874e-09K Homology
Orthology groupMCL12481 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206058-TA
ATGGCACCTTGTCGTCAGCTACTCTTATGGTCAGTTCCATCAATAGCAGTCTTACTTGGTATCTTCTGGTTCAAGAAAAAGAGAGAGTTTGCAAGATCTGACCCAGGGGGAAGAGAAAAAATAAAAAGTTTAAAAGAAGAACTTGCTGAGGCCCTTAATGCTGAAGCAGAATTATTAAAAACCTCTCCTCTAGGAAAAGTGGAGCATTCGTTAGTCAAGTCCTTACCTATTGACATAATCCCAAATGGAAGTGGTTCCCAGAGGTCTTCTCCCATAGAACTGACTGACGAAGAAGTTGATCTTGAGATAGAAAAGATTATCAAGAAAAAATCTCTTGAAATAGAGAAAAGATTGTCATTACACACTGAGAATAAGATGGAAGAAATTCGTCTGAGAGATGCTACCATTTCTAAACAAGATTGTCAATCCAAATTAGGTTCTTCTTTTAAAACTGAATTGAATTGCATTAGAAACAATCCTAACTTACAATTAGATGATATTCAAAAAATTGATGAGAATGATGACTCATCAACTGGGTCGGATACAAATTCTCAAGTAGATAGTATAAAATTAGATGCAACGGCATCTGAGAATATTGTGACTGAAGAGACCACTGAAAGTGATAATGAAGATCAAAATGATGACAATGATGATGAAACGGAAAAAACAAATGAGGCTCCCGGCAGCGATAACAGTTATTCAGCTCCACAGAATAGGCGGATTTCAGAACGAGATTCTGCGAACCACAGTCCTGTGGACCCAATGTTAGCGAGCCCATCAATGTGTCACTTCTCAGATAACCATAGTGAAGGATCAAGTGATAGTGGTAAAGGTTGTTCAGAGGCTGCCAGTCCGCCTCCAACTAATATTAACATAGTCTCTGCTGAGGCGGGTTTAAGAATACATCAGTTTATTATACCACAGACATTAGTTGGTCTATTAATAGGAAAATGTGGATCATTTGTGACTAAAATAAAGGCAAAGACTGGAGCAACAGTATATGTGAGAAGGCATCCAGATTTAGTTAAACAAAAAATTTGTGCAGTTGAAGGAACTCAGAGTGAAATAGAGGCAGCTTTAGAAATGATAAAAGAAAAATTCCCCGAAAGTAGGTTCCCCCACTTTTCAATTCAAGAAATTAGTGCAGAATTATATCAAAGACTGACTCCTGTAGTGCCAGAATTCCTTCAATTGCAGCTGGTGGAGTCGGTGAACAACGACACTATCTTGACGTGTCTGGTGAGCGCTGGTCACTTCTTCCTGCAGCAACCACTACATCCAACCTTCCCGTCGCTCCACGCGCTTCACCGCCTCATGGCCGCCACTTATCAGAGCCCGGATGTACCAGCACTGCCAAAACCGGTCCAAGAGGGTTCGATCTGTGCCGCTCCAACGGAGAACAACTGGTACAGAGCTCAGATAATATCCACATCTGAAGAGAATGACACGTCCGTTGTTAAATTGGTCGACTTTGGTGGCTACCTCACAGTTGACAATGACCAGCTTAAGCAGATCCGCTCGGACTTCATGACGCTGCCTTTCCAAGCCACCGAAGCACTGCTAGCATTTGTAAAACCCGCCAATAGCGAGAGCGAATGGAGTGGTGAGGCTCTTCGTATAATGGCTGGTCTGACGGCGGGTCAACTCCTCCACGCACAGGTCGCCGGCTACGACGAGCGAGGACTGCCGCTAGTACATCTTTATCTAACACTTAGCCCACAGCAAGTTATATTCCTGAACCGCGAGCTGGTGGACCGCGGGCTGGCGGAGTGGGACCTGCCGGCGGATTCGTGA

Protein sequence:

>DPOGS206058-PA
MAPCRQLLLWSVPSIAVLLGIFWFKKKREFARSDPGGREKIKSLKEELAEALNAEAELLKTSPLGKVEHSLVKSLPIDIIPNGSGSQRSSPIELTDEEVDLEIEKIIKKKSLEIEKRLSLHTENKMEEIRLRDATISKQDCQSKLGSSFKTELNCIRNNPNLQLDDIQKIDENDDSSTGSDTNSQVDSIKLDATASENIVTEETTESDNEDQNDDNDDETEKTNEAPGSDNSYSAPQNRRISERDSANHSPVDPMLASPSMCHFSDNHSEGSSDSGKGCSEAASPPPTNINIVSAEAGLRIHQFIIPQTLVGLLIGKCGSFVTKIKAKTGATVYVRRHPDLVKQKICAVEGTQSEIEAALEMIKEKFPESRFPHFSIQEISAELYQRLTPVVPEFLQLQLVESVNNDTILTCLVSAGHFFLQQPLHPTFPSLHALHRLMAATYQSPDVPALPKPVQEGSICAAPTENNWYRAQIISTSEENDTSVVKLVDFGGYLTVDNDQLKQIRSDFMTLPFQATEALLAFVKPANSESEWSGEALRIMAGLTAGQLLHAQVAGYDERGLPLVHLYLTLSPQQVIFLNRELVDRGLAEWDLPADS-