Monarch geneset OGS2.0

DPOGS202047
TranscriptDPOGS202047-TA3654 bp
ProteinDPOGS202047-PA1217 aa
Genomic positionDPSCF300053 + 113451-120960
RNAseq coverage715x (Rank: top 18%)
Annotation
HeliconiusHMEL0099330.046.48% 
BombyxBGIBMGA001281-TA7e-12549.08% 
Drosophilarig-PB8e-1721.51% 
EBI UniRef50UniRef50_D1ZZV02e-7923.51%Putative uncharacterized protein GLEAN_08111 n=1 Tax=Tribolium castaneum RepID=D1ZZV0_TRICA
NCBI RefSeqXP_001815774.14e-8023.51%PREDICTED: similar to gemin 5 [Tribolium castaneum]
NCBI nr blastpgi|1892364919e-7923.51%PREDICTED: similar to gemin 5 [Tribolium castaneum]
NCBI nr blastxgi|1892364918e-8223.55%PREDICTED: similar to gemin 5 [Tribolium castaneum]
Group
Gene OntologyGO:00055151.4e-37protein binding
KEGG pathway 
InterPro domain[508-854] IPR0110461.4e-37WD40 repeat-like-containing domain
[464-742] IPR0159433.4e-22WD40/YVTN repeat-like-containing domain
[765-805] IPR0016806.5e-08WD40 repeat
[769-805] IPR0197812.2e-07WD40 repeat, subgroup
Orthology groupMCL16661 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202047-TA
ATGGACGAGACTGTAATATTTCCGTCGCCAAATTGGTTCCAAGCATCTGTGATAGCGATTTCTCATGATGGATGGCTTATTTATGGCGGGCCCATTAAAAGCCTTTGTATCTTAGAACCGTTACATTCTGAACACGATGGAGTTTTCAAACACAATCAATCTTATAGAGCTCATGTGATGAATAAAGCACATTTAGAAAAGATCACAAGTGTGGACATTTCAAAGGAATGGCCAGAAAAGAAATTAGTTTTAACTGGAAGTGCTGATGGTTGTGTAAAGCAGTGGAACTTAGAACATTTTAAGAACTCTATAAGACTTAAATCTACCCTCAGTCATGAAATTCATTATAATGATAAAGAGGATGTAGCTGGTCTAGGTTATAGTACGGATGTGTTTGCCATCACAGTGGGAGGCTATGGTAATATCGTGAAATGGGATTTGAAATCCAATGTTGTTAAGACTTATAATCAATTTTTAAAAAGCTTCAAGCCAACTTGTGTTGCATGTTCTCAGCACACACCATTAAATGTTGCCGTGGGAACAAAACAGGGAGTTGTATTTGTTTTAGATTTAAATGGGAATGGCAAAATTGTCTATAAGGTGAGGGGTCAGGATGATGAGATAATCAACTTGTCATGGTGTCCACAATATGAGGTGATTCTTAAAAAGACCCTCAAAGAATCACAAAATCGTACACATCTAGAAAGTAAATTAGATAAATTAAAATTAAAGGATGCTGAAGAAGATTTAAATGATTCAGGAATATCAAAGAACCTTCCAGAAGACAGCTTTGATGAATCCATTGCACAGGAAGATGATATGTTTGATATATACAAAGACCATGAAGCAGATGAATTTGGTCATAAAAAGTTTCAGCCGACCGACATCATAGTGAAGTTGAAAAAAGAAACACCTTCCGGTGACTTCTTGGCTGAGTGTTTAAAACTAAAGGAGGCAATAATAAATAAAAAAAATGATAAGGAATCTTCAATAGCAACTCTAGTTGATGCATTGGATAAGACGCATGTAGACAGTAATGATTGTGGTGAAAATAAAACAGATGTTAATGAAATACAAACAGACGGGGATGACTTGCATAAAACAGATAATGCAGGAGAGGTTACAGAAGAAACACAATCTAAAAATATGGAAGAATGTAGTTCTCATTTGCACAAACATCTCTTGGCTACTATCGGAAAATACGGGGGCGTAAGGCTGTGGTCCAAATCGGGGAAACTTGTAGGTTCTTGTGTTGTACCAAACGCCGTGAACAAGAATCATAGGAGTAAAGGTCCCATAGCAACAACATTATTATGGTACAAGCCTGATGTATTGCTCATCGCGGATGGAAAAAGTCAATTGCTTGAGTGCAATCCAATGAAGATAGACTGTAGGAACAAACTAGATTGGCAGATAGTCCACTCGTTGCACAAACGTGGTTTGTACGCGATCGCAACTAACGCGCCTCGTGTCCAAACAGAAAATTCGAATGGTTCAGATGATTGGCTAGTTTGGACGATTGCGCAAGATCGTAACATTGTCTGTTATTCTATGGAAAGAAAGGAAAAGATATCTGTGCACAACACCTGTGGAGGCTTTGTATACTCCATACAGCACTGTCCTTATGATGCTAAGAAGATAGCAGTAAGTGTAGGTGATGGGGCTGTACGCATTTGGAACACAGATACGCTTGTAGAAGATGACAGCAAATTGTCTATGGGTCATGTGACTTCATACTGGCAGAATGTTCAAGGCAAGGTGTTGACGGTTGCATGGCACCCCACAAAAGAGAATTTACTTGCCTTTGGTACTGCTGAAGCCAGGGTGGGTTTGATTGATACAAGTGGTAAGACAGAGCGGCCGGCCAGAACATTACTTCCAGTACTACAAGGTGGAGTGTATTCTTTGTGTTGGGGACGGAATGATCAACTTTATGCCTGCGGTGGAGGGAAACTGGTCGTCTATAACACAGATGCTATCGATAAAGATCCAATGCCAATAAAAGTCCAATTTGAAGGAAAGCAATGGGAATTAAGTTCAGTGTTGTTTCATAGCCGAGGTCTAGTGTGTGGTGGAGTTAATGGGGCTTTGGCTGTATTAGATCCTGACACCAATGAAATTTTAACTGCGTCTTTTATATTTGGCAAAATGATATACACTACAGAATGGCACCCTCAGCAGACATCTACATCCAGCGAGGATTCTATATATAGAGACTTAATAGCTGTCTCGTCTCTTGATAAAGCGTGCAGTATTATAATCGTGGAGTATAATGACAAAGGAGACGGTCCTAAAATACACCCGTTTAAAACTTTGTCTGGTCACACGGCGACCGTGCTACAGCTGTCATGGAATCCACACAACGATGTCCAGCTCTTGTCGACCTCACATGACACTACAGTTCGAATCTGGGATATCTCATCTGGTGAATGTACTCATATTTTCGGAGGTCATTGCCACGCCTCGCTCAGTGCATGTTGGAGCTCATTTCCATCACTGTCCAATGTAGTAATGTCAAGCGGGTCCGACTGTTGTTTGAGATTGTGGCAAGTGGACAGACATACAACAGATGTTTATACTGACATGTTTCGCAAAATGGCTCCGGGAGGCGCGAAAAAAACTAAGACTAAAAAGGCTGAAATTCAAGAATTGGAAAAAGGTGAAGAACAAGTCGCTACCACCTTCGATACGAAAGCGTCCACCAAAGCACCAAAGAAATTCCTCTTACCTATAATTAGTAAACAAATATCGCCATGCACTGTGTACAGTGTGAGACAAATGCTAGTCAAATATTGGAGCGATCGAAATGGAACAAACGAGAAGGTGGCTAACGGTCAACCTGATGTCGTGGAAGAAAACGTCGAAGGGAAGGAGGCAGAGGAAAAAATCGTGGAATTCACAAAGATTTTCGGTACAACGAATGATCTGAACGAAGTCTTGGACATGGAAATGGCTCGTCACTCCACATGTAACCGTTGGGAGTCGTGTGTTGTGCTGAACGTGCTCCGCGGTCAGATGTCCGATATGGTGACGTCAGCGGCCGCCCGCGGGGAGCTCTGCCCGTTCATTGTGAGCCTCGCCCCCACCGTCTCTCACAAATTTTGGAAAGATGCAACGCAAATGTATTTGGCTCAAATCGATCGAATGATTGCTAAAGGAGAGGAAGAGAAGCTTAGCGAGAACAAACAGTACGGTGGCGCCATCTACCGTAAGGCGTGTCTTCAGCTGTGTTCACACGACGTACGAGCCGCCGTACATACACTCGTAGACGCGAGACTGTTCAAGGAGGCTTACATATTGGGGAGGGTCAGGCATATGGACAGCATAGCGGAGGACACGTTAAAGAAATGGGCAACTGATTGTTTACAAACTGGCAACATTTGTATGGCTGCGGTGTGTTATTTAGCCTTAGGCGATCCGTACCAAGCCGCTCTGGCCTTGTCAAAATCGGACGATCAAGAACTACTCGGCATAGCGTCGGAACTAGCGAAGGAATCTGGACAGGCGACATTCGCTAATCATATAGAAGATAAGAAAACGCAAATATTAAGCGAAACGTCGGAAAATGATGAACAACTAAAGAAACTTCCTACAAAAATCGATCTATTGATTGATAGTGTTGGCACTAGTGAAGTTACATCGGATGTGATATGA

Protein sequence:

>DPOGS202047-PA
MDETVIFPSPNWFQASVIAISHDGWLIYGGPIKSLCILEPLHSEHDGVFKHNQSYRAHVMNKAHLEKITSVDISKEWPEKKLVLTGSADGCVKQWNLEHFKNSIRLKSTLSHEIHYNDKEDVAGLGYSTDVFAITVGGYGNIVKWDLKSNVVKTYNQFLKSFKPTCVACSQHTPLNVAVGTKQGVVFVLDLNGNGKIVYKVRGQDDEIINLSWCPQYEVILKKTLKESQNRTHLESKLDKLKLKDAEEDLNDSGISKNLPEDSFDESIAQEDDMFDIYKDHEADEFGHKKFQPTDIIVKLKKETPSGDFLAECLKLKEAIINKKNDKESSIATLVDALDKTHVDSNDCGENKTDVNEIQTDGDDLHKTDNAGEVTEETQSKNMEECSSHLHKHLLATIGKYGGVRLWSKSGKLVGSCVVPNAVNKNHRSKGPIATTLLWYKPDVLLIADGKSQLLECNPMKIDCRNKLDWQIVHSLHKRGLYAIATNAPRVQTENSNGSDDWLVWTIAQDRNIVCYSMERKEKISVHNTCGGFVYSIQHCPYDAKKIAVSVGDGAVRIWNTDTLVEDDSKLSMGHVTSYWQNVQGKVLTVAWHPTKENLLAFGTAEARVGLIDTSGKTERPARTLLPVLQGGVYSLCWGRNDQLYACGGGKLVVYNTDAIDKDPMPIKVQFEGKQWELSSVLFHSRGLVCGGVNGALAVLDPDTNEILTASFIFGKMIYTTEWHPQQTSTSSEDSIYRDLIAVSSLDKACSIIIVEYNDKGDGPKIHPFKTLSGHTATVLQLSWNPHNDVQLLSTSHDTTVRIWDISSGECTHIFGGHCHASLSACWSSFPSLSNVVMSSGSDCCLRLWQVDRHTTDVYTDMFRKMAPGGAKKTKTKKAEIQELEKGEEQVATTFDTKASTKAPKKFLLPIISKQISPCTVYSVRQMLVKYWSDRNGTNEKVANGQPDVVEENVEGKEAEEKIVEFTKIFGTTNDLNEVLDMEMARHSTCNRWESCVVLNVLRGQMSDMVTSAAARGELCPFIVSLAPTVSHKFWKDATQMYLAQIDRMIAKGEEEKLSENKQYGGAIYRKACLQLCSHDVRAAVHTLVDARLFKEAYILGRVRHMDSIAEDTLKKWATDCLQTGNICMAAVCYLALGDPYQAALALSKSDDQELLGIASELAKESGQATFANHIEDKKTQILSETSENDEQLKKLPTKIDLLIDSVGTSEVTSDVI-