Monarch geneset OGS2.0

DPOGS204167
TranscriptDPOGS204167-TA2655 bp
ProteinDPOGS204167-PA884 aa
Genomic positionDPSCF300034 - 260165-262977
RNAseq coverage112x (Rank: top 59%)
Annotation
HeliconiusHMEL0139650.085.87% 
BombyxBGIBMGA005083-TA0.075.18% 
Drosophilapros-PI2e-3938.86% 
EBI UniRef50UniRef50_D6WUC42e-13443.16%Prospero n=3 Tax=Coelomata RepID=D6WUC4_TRICA
NCBI RefSeqXP_971664.25e-13042.16%PREDICTED: similar to homeobox protein prospero/prox-1 [Tribolium castaneum]
NCBI nr blastpgi|2700111078e-13443.16%prospero [Tribolium castaneum]
NCBI nr blastxgi|2700111079e-15544.33%prospero [Tribolium castaneum]
Group
Gene OntologyGO:00056343.5e-128nucleus
GO:00036773.5e-128DNA binding
GO:00072753.5e-128multicellular organismal development
GO:00063553.5e-128regulation of transcription, DNA-dependent
KEGG pathway 
InterPro domain[98-829] IPR0077383.5e-128Prospero homeobox protein 1
Orthology groupMCL14987 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204167-TA
ATGTCATCGGAGGAGGAGGCCGACTCACACGCGCCCTACTCGGATAAACTACTGAAGAAACAGAAACGCGTCAGACAGCGCGTGGACGCCGGCGAGCCGCGCAATTCTTATTCTACTCTCGCGACGAACGGACCCGCGCCCGGGCGCGTCGCACACATGAGTGGAGGACTTTACGGGGCCATCTTCGAGGGGCGCCAGCACTTCGGTCTCTTCGGCCCTTGCTACGCTCCGGCTGAAATGTTAAACGAATTATTAGGCCGTGCACCCAGACAAGAGGATGCCGGCGACGAAGCTTCCAGCGGCGATATGCTTCGCGATCGTGTCCTTCGCGACATTCTACAAAGTCGGAAGAAAGAATTGATGCGATTATCGCCTGACAATAACAACGTGCTCGCTAATAATAATAATGATGAACCCAAAGAAGAGAAACGCTCTCCGGAGCGGCCTATCTCCCGCGACTCTCGGCATGATCTCGACGATGCTCTATTGAATCTCGACAGTGCCGGTGACTCGCGGACCTCGCCGCCTCCCGTTTCTCAAAACGAACCACTTCTTGCTCCAAAGTCTGAACCGGAAGACCGCGATAGTGACGCCCAAAGATCTTCCCCGAGACCTCTTGATGTGAAACGTGCTCGTGTAGAAAATATAGTGTCAACGATGCGTGCTAGTCCCGCACCCCAACAGCCACAAGTGAACGGTTGTAAAAAAAGAAAATTATATCACCCGCAACAACACGACGGTGCAGCAGAGCGTTATGGAACAAGCCATGGCAGCAGTGCACAGACAGTGTCGGATGAGTCTGAGGATGACGGTGATCAACCACCGATTCAACAAAAGTTAGTAGAAAAAAATGCATTAAAAACACAACTGAGAACTATGCAGGAACAACTTGCGGAGATGCGAGAGAAATATATACAGCTATGTAACAGAATGGGTCAAGAATCAGAAACGGCTGATAATGATGGTGCTTCTAGTGATATTGAACAAAACGAGGATCCCGTGCCCAAATCGGAACCATCGTCTCCAGTTAAAGAGGTACCTCCTTCAATACCTAATAGTGCAGCGCCGAATATGTTCAACCAAGTCATAAATAATATGATGTCAGGAAAACTGCCGTCACACCCTGCAGCTCATCCTCACTTGCCACCTGGTTTCAATGGAGCACTACCTCTTATGCCCCATATGCAACCTGGTGACCACATGCATCCACCACATACTCATCAACATCTTAATAATGCTGCTGCTATGTACCTTAACGTCAGCCAAAAGCTGTTTTTAGAACAAGAGGCACGAATGAAGGAAGCTAGTGAACAGCAACAAATCAGCCAACAACGAAGGCCACAATCTAATCAGCACTCACCACAACAGAGACAAATGGGTCCAACTCCTAAACCTCCAGCCTCTGAACTGGCCGAGCGTCTTGATGCATTACGAAGCAATTCAGGGTCCGTTGGACCTGTATCTGGTGCTGACCTAGAAGGCTTAGCCGAGGTTCTGAAAAGCGAAATAACAGCTTCTCTAGCAAGCCTCATTGACTCGATAGTAACCCGTTTCGTACATCAACGTCGTATAATGGGGAAGCAATCAGAGGCAGCAGCGGCAGCGGCGAAACAATTGAACAAGGACTTAATTCAAGCTGCTCGTCTCATCGAGAAATCTCCAACGCCAAAGATGCCGGAACGGCCGTCAGGACTCATGCCAGGTAATCCACCCGTCCATCATCCGGGCGCTCCTAATGGCGTACCGCTAATGCCCAATAACCCGATGTTTATGAGTCATATGAACGGCCCCCGTGCGCCGGGCGGAGCAGTGTTTCCTCTGCACGCCGAGGCGGGCGGGCCAGGCCACGGAGCTCACATGCGGCCTCCAACAGGCATGTTCCAGGCACCGCAGAAGTCACTACAGTCGCACTTCGGTTCATTAAATGGACACTTCGATCGAGACCAGAATTCAGACCCGAGTGAGCCCCTGAGCCTTGTGATGACCCCTAAGAAAAAGCGACATAAGGTGACCGACACTCGCATCACTCCTCGCACAGTGAGCAGGATTCTAGGAGAAGGTGTCGTCCAGTCACCGGAGATAAAATTCCCGGAATCGCCGTCACCGCGGCCGTTCCACGGAGGAATGGCGCTGCCGACCTCAGTGGCGATACCGAACCCATCACTACACGAGAGTCAAGTCTTCTCTCCGTATTCACCATTCTTCGGTCCAGGCGGAGGATCGGTTGGATTGGCTCGATCTCCACCGGGACCCGAGCGGGACTCGCCACCGCTGCCAAACTCGATGCTGCACCCCGTCTTGTTAGCGGCAGCTCACCACGGCTCGCCGGACTACATGCGACACCAGCACGTGCCGCATCACGCCGCCGGCCCGCATCACGCGCAGCACATGGACGCGCAGGACCCTCACTCCGACTGCAACTCCACTGAGATGCCTTACGACGGAGTTCAGCCTACTATATCCTTTTCAAATATAACTTATAAATTTATTTCAGCTCAATTTAATATGTATGACTGCGCCACCTACTTCCTACCAGTAGTTGATATTATACGACTGTTGAAACCACACAAGTGTTGCGGTTTCTACGCTTTACCGTACTTGCCTATCGCCTACCTATACATGAAGGAAATAACCGTAAACTTAATTAAACGCTGA

Protein sequence:

>DPOGS204167-PA
MSSEEEADSHAPYSDKLLKKQKRVRQRVDAGEPRNSYSTLATNGPAPGRVAHMSGGLYGAIFEGRQHFGLFGPCYAPAEMLNELLGRAPRQEDAGDEASSGDMLRDRVLRDILQSRKKELMRLSPDNNNVLANNNNDEPKEEKRSPERPISRDSRHDLDDALLNLDSAGDSRTSPPPVSQNEPLLAPKSEPEDRDSDAQRSSPRPLDVKRARVENIVSTMRASPAPQQPQVNGCKKRKLYHPQQHDGAAERYGTSHGSSAQTVSDESEDDGDQPPIQQKLVEKNALKTQLRTMQEQLAEMREKYIQLCNRMGQESETADNDGASSDIEQNEDPVPKSEPSSPVKEVPPSIPNSAAPNMFNQVINNMMSGKLPSHPAAHPHLPPGFNGALPLMPHMQPGDHMHPPHTHQHLNNAAAMYLNVSQKLFLEQEARMKEASEQQQISQQRRPQSNQHSPQQRQMGPTPKPPASELAERLDALRSNSGSVGPVSGADLEGLAEVLKSEITASLASLIDSIVTRFVHQRRIMGKQSEAAAAAAKQLNKDLIQAARLIEKSPTPKMPERPSGLMPGNPPVHHPGAPNGVPLMPNNPMFMSHMNGPRAPGGAVFPLHAEAGGPGHGAHMRPPTGMFQAPQKSLQSHFGSLNGHFDRDQNSDPSEPLSLVMTPKKKRHKVTDTRITPRTVSRILGEGVVQSPEIKFPESPSPRPFHGGMALPTSVAIPNPSLHESQVFSPYSPFFGPGGGSVGLARSPPGPERDSPPLPNSMLHPVLLAAAHHGSPDYMRHQHVPHHAAGPHHAQHMDAQDPHSDCNSTEMPYDGVQPTISFSNITYKFISAQFNMYDCATYFLPVVDIIRLLKPHKCCGFYALPYLPIAYLYMKEITVNLIKR-