Monarch geneset OGS2.0

DPOGS209287
TranscriptDPOGS209287-TA1740 bp
ProteinDPOGS209287-PA579 aa
Genomic positionDPSCF300359 - 33156-40291
RNAseq coverage12x (Rank: top 83%)
Annotation
HeliconiusHMEL0031083e-11585.89% 
BombyxBGIBMGA013723-TA5e-11274.91% 
DrosophilaCG4050-PA9e-4331.07% 
EBI UniRef50UniRef50_UPI00021A88EF5e-12046.41%UPI00021A88EF related cluster n=6 Tax=unknown RepID=UPI00021A88EF
NCBI RefSeqXP_002426792.13e-11049.02%smile protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2700015513e-12648.53%hypothetical protein TcasGA2_TC000396 [Tribolium castaneum]
NCBI nr blastxgi|2700015512e-12848.53%hypothetical protein TcasGA2_TC000396 [Tribolium castaneum]
Group
Gene OntologyGO:00054886.9e-27binding
GO:00055157e-05protein binding
KEGG pathway 
InterPro domain[63-137] IPR0136183e-28Domain of unknown function DUF1736
[225-408] IPR0119906.9e-27Tetratricopeptide-like helical
Orthology groupMCL15715 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209287-TA
ATGCTGGCCAAAGAGACAGGACTGACTGCACTGCTGTTTAATTTAGTTTTTGATATTTATCGAAGTTGGAGCTCAGTCACAAGGTCGCCATCGAAATTCCGTTGGCGGCACGACGGTCTGTGGAGCAGACTGAGCAAAGGTCTCATAGTCCTGATCATGTTAGCCATTGCTAGGCTCGCCTTGCTTCAAGGATCCTTACCAGCTTTCTCGACGCAAGACAACCCACCGGCGTTTCATCCATCCTTCATTGTCAGATTAATGACGTTCTGCTATTTAGCAGCCTTCAATTGGTGGCTGCTCCTGTGTCCTTGGACTCTTAGTCACGACTGGCAGATGGGGTCGATACCTCTGATTACCAGCGGTTGGGATCCAAGGAATCTTATTACTGGGGCAGCCTTAGTAGCCCTAATGGCGCTGTCATATAGATTCTTGATGGACTTGGAGCTTCAGAGACATACTCCTCTGGTAGTCGGATTGATGCTCCTTGTAATCCCATATCTACCAGCATCCAACCTACTGGTCACTGTAGGATTTGTTGTAGCTGAACGCGTTCTGTATATACCAAGTGTAGGGTCAGTTCTTTTGACGGTATACGGTGTGCAAATATTGTGTCATCGAGGTGGAAGGTGGCTGGCACTAGGACTGGCAATACTTGTAGCTGCTGGCGCTGCGAGGACCTTCGATAGGAATAGAGACTGGAGGACGAGAGAGACACTGTTGAGGGCGGATCTAGCGGTTTTACCGCACAATGCAAAATTGCACTATAATTTTGCAAACTTCCTTAAAGACATCGAGCAACAGGAGAACGCTATAAAACATTATAAAGAAGCGCTTAAACTGTGGCCAAGTTACGCGAGTGCTCACAATAATTTAGGCACGTTGGTCCTGGCATCAGGACGAGCGGAGCATCATTTTCTACAAGCTCTTAAATACAATAAGGACCACGTGAATGCTCATTACAACCTGGCCAAATTATACCGGAAAAAGAATCGCGTGTCTGAATCATTAAAGATGTTGGAACGCTGCATAACTCTGGAGCCGCGGTTCGTCCTGGCCTACATCGAACTTCTTCACATCACCCCCGAAAACGAGAAGAGACCCACACTAGAGAGATTGATTGAATTGGAACCGATGAATTGGGAACATTACTACCTGTATGGAAACTGGCTGAAGGGAAGAGGTCTTTGGCCTCTGTCTCTCTCGTACTATAAACGGTCTATCCGTGTGTCGTTGTTGTCTCGTGGAGCACTACCGCCGCTGAGGGCAGCATGCCTTCTGTTGAGGTCTGCTGGTCAACGAGTACGGCTTCTGCAGCTAACCACGAGCTGGCACACCCTTGTGGGAGGTGAGGGGGCAGCCTCACGACGTCGGGCTGCGGCCGCGGCCTGGAGACTCCGCAGTGAATTGGAAGGCCGAGCTGCTAGATACGCTAGGACTCCTGTAATGACGTCCACTTGTTTGCATCACAGTCAACTCGAGGTGTCGCCGGAGACCATTAAACAAAACAGTTTAGTGAAACACACGAGGACAATAACTGTATCCAGTGATTTAAATATAAACACAAGTTGCCAAAAATTAGATAATAAAAAAAGCGTATCTATATCTGCTCAATATAAACCTATTCACAATAAATCTGAAGCCGGCCCTAAACAGAAGAGTTGCCCTCTACATAAGAAAAACAAAATCAAAGAGCCGGATCCCCTGCCGTTTGTTAGCGATCATTTAATAAAAACATCCTAA

Protein sequence:

>DPOGS209287-PA
MLAKETGLTALLFNLVFDIYRSWSSVTRSPSKFRWRHDGLWSRLSKGLIVLIMLAIARLALLQGSLPAFSTQDNPPAFHPSFIVRLMTFCYLAAFNWWLLLCPWTLSHDWQMGSIPLITSGWDPRNLITGAALVALMALSYRFLMDLELQRHTPLVVGLMLLVIPYLPASNLLVTVGFVVAERVLYIPSVGSVLLTVYGVQILCHRGGRWLALGLAILVAAGAARTFDRNRDWRTRETLLRADLAVLPHNAKLHYNFANFLKDIEQQENAIKHYKEALKLWPSYASAHNNLGTLVLASGRAEHHFLQALKYNKDHVNAHYNLAKLYRKKNRVSESLKMLERCITLEPRFVLAYIELLHITPENEKRPTLERLIELEPMNWEHYYLYGNWLKGRGLWPLSLSYYKRSIRVSLLSRGALPPLRAACLLLRSAGQRVRLLQLTTSWHTLVGGEGAASRRRAAAAAWRLRSELEGRAARYARTPVMTSTCLHHSQLEVSPETIKQNSLVKHTRTITVSSDLNINTSCQKLDNKKSVSISAQYKPIHNKSEAGPKQKSCPLHKKNKIKEPDPLPFVSDHLIKTS-