Monarch geneset OGS2.0

DPOGS207351
TranscriptDPOGS207351-TA1785 bp
ProteinDPOGS207351-PA594 aa
Genomic positionDPSCF300188 + 310950-313547
RNAseq coverage586x (Rank: top 22%)
Annotation
HeliconiusHMEL0088577e-9369.26% 
BombyxBGIBMGA010278-TA2e-11441.51% 
DrosophilaCG11180-PA4e-4243.42% 
EBI UniRef50UniRef50_UPI00022CA7841e-4348.09%UPI00022CA784 related cluster n=1 Tax=unknown RepID=UPI00022CA784
NCBI RefSeqNP_001037085.18e-5282.91%PIN2/TRF1-interacting protein [Bombyx mori]
NCBI nr blastpgi|1129830022e-5082.91%PIN2/TRF1-interacting protein [Bombyx mori]
NCBI nr blastxgi|1951540448e-5333.90%GL17437 [Drosophila persimilis]
Group
Gene OntologyGO:00056223.6e-17intracellular
GO:00036763.6e-17nucleic acid binding
KEGG pathway 
InterPro domain[25-71] IPR0004673.6e-17D111/G-patch
Orthology groupMCL18892 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207351-TA
ATGTCGATGTTAGCGGGGCCTCGTAGGAAACAAAAAATCATAAATTTAAGAGCTAAAAACAGCGCTTGGAGCAATGATACTAACAAGTTTGGTCAAAGAATGTTGGAGAAAATGGGCTGGAGTGCCGGAAAGGGGTTAGGTGCTAATGAAAACGGAATAGTTGAACATATCGTAGCCAGATACAAAAATGACGAAAAAGGTCTTGGTTATGAGGACAAAAACGATCAATGGACGAAACACGAAGACGATTTTAACTCATTACTAGCCAACCTCTCTAATAATTCGGAAAACAACTCTGAGAATTTGCATAGTGGTGTATCGTTAGAAGATAAGTCAAAGAAAAGCAAAGCACGTATACACTATCACAAGTTCACTAGAGGCAAAGATCTGTCTCGCTACAGTGAGAAGGACCTTGCAAATATATTTGGAAAGAAAACCTTTAAGCAAGAATCAGTTAAAGAGCCTGTTAAAGATGATATTAGTAATATAGAACAAAAATTTACTGAGAAAGGAAATATGGATGATTATTTTAAGAGCAAATTAGCATCACTTAAAAATAAGCCAAAGGTATTATGCAATGATAATATTGATAGAGAGGAGATAAACTATGGTTTTCAAGGATTTTCTACTGAAACAAATGATCAGAACAACGAATGTGATATCCCCAAAGTAGGATTTCAACCATTCTCTTTTTATGGAACACAATCAAACAATGACAATAAACAAGACACAGCAGATATATCAGAAGGTAATGAAGAAGTAATTAAGAGTAAGAAGAAGAAAAAATCTAAGCAGAAACATCCCTATGACGATTCAGAGAATTGCTCTGAATCACAAGAAGTTAATGGAATGTCCGAACAAAACTCTGATGAGCAGATATTGCCAGCGAAAAAAAAGAAGTCTAAAAAGGATAAAACAAAGGAAATAGAAAGCGAATGTTTTGAAAGTATCAAGATTAAAAAGAAAAAAAGGAAAAATGAACAAGAATATGTTAATGAAGATCATCAAAAAAAGGGTACTATGGAATCCAAAAAATACGTCGTCGTTAAAAACCACGTTCGAGAAATAGTAGACAATGAAAATGAGGATGTTACAGAAGAAACAACGAGCACAGACAAACAGAAGACTTCGAAGAAGTCTAAAAAAGCAAAAAAGGACCAAACTTCTACAGGCCAAGATGAAAGCGTCAGTGATATGAAAGAAAAAGATTTGCAACAGCTTTTATCGGCTTACAACAGTTGTAAAGCCATTATAACTAAAATAGAAACGAAATACGGACACTTATTAAATTTGAATAGTAATGAAGAAGGGACAGAATATGATACTTCCAGGAAAAGAAAATACAGATACAATCAATGTTACTGCAGTCCAAACAAAAGGTACAGATACGACGAAGGCGGTGATTCACAATACTACGATAATAGATATTCTAGACAGTGGGGTTATAAACAATATAATAATTATGATACGAGTTACAACAGACAATCCAATTACGATTCACCATACAACTATTATAACCGTAAATCACATGTCCAAGTTGAGGATGAAAGTAGTTACGAACTGCCGAACTCACTGTGCGAGTTAGGAGATCTGTTGAAAGATCCTTCGATACATAGGAGCTACAGATATAGAATTATACAGCAAATGAAATATATGAGACGGGAATATTCAAGCATTATGAGATATGATAAGAGATATATGATCCAGCAGTTAAAGTTGAATCCCGATGAATATTTTGAGTTCAAAGGTTCCAATATATCATCACTAGAAGGGTACCCTGCATAG

Protein sequence:

>DPOGS207351-PA
MSMLAGPRRKQKIINLRAKNSAWSNDTNKFGQRMLEKMGWSAGKGLGANENGIVEHIVARYKNDEKGLGYEDKNDQWTKHEDDFNSLLANLSNNSENNSENLHSGVSLEDKSKKSKARIHYHKFTRGKDLSRYSEKDLANIFGKKTFKQESVKEPVKDDISNIEQKFTEKGNMDDYFKSKLASLKNKPKVLCNDNIDREEINYGFQGFSTETNDQNNECDIPKVGFQPFSFYGTQSNNDNKQDTADISEGNEEVIKSKKKKKSKQKHPYDDSENCSESQEVNGMSEQNSDEQILPAKKKKSKKDKTKEIESECFESIKIKKKKRKNEQEYVNEDHQKKGTMESKKYVVVKNHVREIVDNENEDVTEETTSTDKQKTSKKSKKAKKDQTSTGQDESVSDMKEKDLQQLLSAYNSCKAIITKIETKYGHLLNLNSNEEGTEYDTSRKRKYRYNQCYCSPNKRYRYDEGGDSQYYDNRYSRQWGYKQYNNYDTSYNRQSNYDSPYNYYNRKSHVQVEDESSYELPNSLCELGDLLKDPSIHRSYRYRIIQQMKYMRREYSSIMRYDKRYMIQQLKLNPDEYFEFKGSNISSLEGYPA-