Monarch geneset OGS2.0

DPOGS207762
TranscriptDPOGS207762-TA3138 bp
ProteinDPOGS207762-PA1045 aa
Genomic positionDPSCF300042 - 333516-340453
RNAseq coverage178x (Rank: top 50%)
Annotation
HeliconiusHMEL0175500.059.69% 
BombyxBGIBMGA005314-TA2e-11167.48% 
DrosophilaCG7837-PA1e-1632.92% 
EBI UniRef50UniRef50_D6WIH02e-4027.61%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WIH0_TRICA
NCBI RefSeqXP_392779.21e-3125.32%PREDICTED: similar to CG7837-PA [Apis mellifera]
NCBI nr blastpgi|2700032239e-4027.61%hypothetical protein TcasGA2_TC002427 [Tribolium castaneum]
NCBI nr blastxgi|2700032234e-4125.61%hypothetical protein TcasGA2_TC002427 [Tribolium castaneum]
Group
Gene OntologyGO:00054886.4e-25binding
GO:00055151.3e-07protein binding
KEGG pathway 
InterPro domain[1-371] IPR0160246.4e-25Armadillo-type fold
[323-362] IPR0119892.4e-21Armadillo-like helical
[869-974] IPR0113332.8e-09BTB/POZ fold
[869-974] IPR0130691.3e-07BTB/POZ
Orthology groupMCL25126 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207762-TA
ATGGATAAAACACAAGTTAAGGCCATGTTAGATGGACTTAAATCTTCAACTTCAAAAATAATCCAAGAGTCTTTATTAAAAATTAAGTCAATGATTGTTAATTCTGAGAAAGGAGCTAAACTCTTCAGGGAATGTAATGGTTTTCCTTACCTGGTACCACATCTGCTGAAACCAAATGAAAATATTCTGAATCTAACATTAAGTATCCTGGGGGACCTGTGTCTAGATCAGAAAAACTGTATGGCTATTGGAAAATTGAATACGTATGGACCTTTGGTAACGATATTAAATACGGTATGTCGCGATAGCATTCTAGGGAGGACATCCCGTTTAATTGGTAATTTGGCTCGTGACAGGAGTAATGCTGAAAAATTTTTTAATCACGGCACGGTAAAAGCGCTGATGGCCATTATTGATAATAGGGATAAAAAAACCTCGTACGCAACCCTCATTATGGTCGTAAGAGCTATTCGGAAGTTGTGGTCGGTGGAAGAGAAGAGAAATGAAATGATTAGTATGAACGCGATCCGTTGCGTTGCTGTATTGATGACATCTGAATGCGAGATCATGGGCTACATTAAATCCTCTGACAGCGACAGTGACGTCGAAGAACCTAGTAGGCTCCAAGAGGACTTTATGGGTGGCATCCTAAAATGCATATGGAGTTTCACTTCGCACCCCGTCGCGTCTTGTGCTGAACAGATCCAAGGCGACGGCCGCGGCTATCAGTGCCTGGTGGTGTTAACAAAAACAAACATGACGATAGCCATGAAGTGTTTGACGAACCTGTGCTTCATATCGTCCTGCCGGCCGCAGCTGGGTATGGCGGGATTCGTCGAATGTCTCATAGAGAATCTGAAGAAGGAGAAGGACGTGTCATATTGGCCGGACGGGTCACCTATGGCTTTGGCCCAACTGAGCGGGGAATCCGTGAACAGATCTCGTCTCCGCCGCTGTGGGCCAGACGGAGATGACCACTGGCGCGCTAAAACAAACACACATGCGATGAACGCACTGCTACAATACGTTTTTGATGATTCTTCGTTCCAAATACTTATCGGCGAAGGACTTGTTAGTATATTGACGGACAAATTAACTACGTACGTCCGTAACATGGGATATGAGCACAACGTGGAGACCAGTGCGAGCAACAAGCGTAAAGAGAAACCTGTGAACCAGGGTTCTGTGTACGATGTGGCTGTTCAGAACCTATCCAGGGATATGTACTACCGTCCCGCCTCTGGTTCTTCCAAACGAAAGAGTCAGCTGCTGTCAGATACAGGGGATGATATGAAGGTTGTGATAGAAAGGGACAACATGATCGTCGGTTTCGTCGACGCCATAGAGAGCGAGGCGAGCGAGAGCGAGAGCGAAAATGAGGAAGGTCCTCCGCCCAAAAAAAGAAATCTGAAACGATCCAAATCCAAGAGTCCGAAGAATTCTAAAAAGAAGTCAACAAACATAGCAAGTAAAGACTGGTCGTCGGGTGTGTATTGGGAGCCAAAAAGTCCAGAGTGGCCGCCGATGCTGCAATCTAATCCCTCTACGAGTCCGAAAAAGGAACCCCAATTACCAGACCTTAATGTTCAGTACACGGGTCCGGGTTCCGCAGAGAGGTTAAATTTCGAATGGAGTCCTGAATCTGGTGTCAGTATCGGAGAATTCTGTACACCGCCTTATCCACCGTGGAGTCCAACCAGACTGACGCCATCTCCTTCCAGCTTAAGCAATGAGGACGAAAGTTCAGACTCCGAAGCCTCGGGCAGCTATTCACCGGTTTGCAGTGACAACGATGAAACTAACGAAGGCACTACCACCAAGACACAGGATGTCGAGGTCATCGAAATAGACGAGGACAGCAACGAAGGGGAATGTGATATGGAAGACCTAAGTAAGCTGAACATACATTCAAAGGAAACAAACATAGCGAGTGTGATGGTACTATTGTTTCGCGTATCTCACGGTACTTGCACCACCTGTGGTAGCATCAGAGATGACTCCGTTCCCAACCACACCATGGATTACCTCACCACCCGGGAATGTCTGTATGGTCTAATGGAATACGTTGAGAAATGTAAACGCCCCATGGGCAGGGCGGCGAGGATACTAGCCAGGGTGTTGAGTAGCGACCTGTGTCTAATGAGTGTGATGAGGCACAGACTAGCGTTACGACTGCATCGCATGTCCACTACTTCTAAACACCCAGCCGCAGAATGCGTTCAATGCAAACAGATTATGAAACTCTGCAAGAAATTAATGAATCAAATGGGTTCATTGGCTGAATCCAGCTATGGTATTGGAAAAATTAGTTATCATCTACTTAAAGGCAGTCCGTCAATGAAACATACACTGGCATTGACCTTGCCGTATATTGTCAAGACTGAAAAAGCATTAAAGAAATATTTAGTAGATTGCAACGGTCTGAACATATTAATAAATCTCATAGACGATGGCAAAGAAGAATTACAAGAATGTGTCACAGCACTTTCAAAACTCGCACACAACGTTGATGTGAAGGATCCTAAACTCTTAGAGAACAGGTACAAAGAAACCGTCCTAATGATCTACGAACCGACCTTTGACAGTTTGTCCCCGGACAGTATCGTGACATTCAAACTGGACGACTCGTCCACTGTTAGAGCGAACAAAGACTTTTTATGCCAGCATTCGGAATATTTTAACGCCATGTTGATGGGACGCTTCAAGGAATCCGCTGAGAACTGTGTCCGTCTGAAGAATGTCACCAAGAGCGGTTTGGAATACCTCCTGACCTTGTTAGACTGCGGCCTCTACGACGCCCATTCCGACTTACAAATCTTTCCAATGGCGCCAAGTTTGAAGACGAATCTGGAGGTTCTGTTATTAGCGGACAGGTTTTTGTTCGAGAAATTAAAAGAATTATTAAGCAGTGCTATATTACAGTTCAAACTGGGCCCGAACACCGCTGACAGAATATATACTTGGTCGTTGAGCGATGGAATGGGTTTCCTCTGTGTGGAGGCAGTAGCGTATATACTTACAGGGAAGATGTCCGACGAGAACAGATATCAATCGTTTAGTAAAATACTTAACCTCCAATACAGGGATCAGTTTCTTGAAGATATTAAGGCTATGCTTTTAAGGCAAATGGCAAAATAA

Protein sequence:

>DPOGS207762-PA
MDKTQVKAMLDGLKSSTSKIIQESLLKIKSMIVNSEKGAKLFRECNGFPYLVPHLLKPNENILNLTLSILGDLCLDQKNCMAIGKLNTYGPLVTILNTVCRDSILGRTSRLIGNLARDRSNAEKFFNHGTVKALMAIIDNRDKKTSYATLIMVVRAIRKLWSVEEKRNEMISMNAIRCVAVLMTSECEIMGYIKSSDSDSDVEEPSRLQEDFMGGILKCIWSFTSHPVASCAEQIQGDGRGYQCLVVLTKTNMTIAMKCLTNLCFISSCRPQLGMAGFVECLIENLKKEKDVSYWPDGSPMALAQLSGESVNRSRLRRCGPDGDDHWRAKTNTHAMNALLQYVFDDSSFQILIGEGLVSILTDKLTTYVRNMGYEHNVETSASNKRKEKPVNQGSVYDVAVQNLSRDMYYRPASGSSKRKSQLLSDTGDDMKVVIERDNMIVGFVDAIESEASESESENEEGPPPKKRNLKRSKSKSPKNSKKKSTNIASKDWSSGVYWEPKSPEWPPMLQSNPSTSPKKEPQLPDLNVQYTGPGSAERLNFEWSPESGVSIGEFCTPPYPPWSPTRLTPSPSSLSNEDESSDSEASGSYSPVCSDNDETNEGTTTKTQDVEVIEIDEDSNEGECDMEDLSKLNIHSKETNIASVMVLLFRVSHGTCTTCGSIRDDSVPNHTMDYLTTRECLYGLMEYVEKCKRPMGRAARILARVLSSDLCLMSVMRHRLALRLHRMSTTSKHPAAECVQCKQIMKLCKKLMNQMGSLAESSYGIGKISYHLLKGSPSMKHTLALTLPYIVKTEKALKKYLVDCNGLNILINLIDDGKEELQECVTALSKLAHNVDVKDPKLLENRYKETVLMIYEPTFDSLSPDSIVTFKLDDSSTVRANKDFLCQHSEYFNAMLMGRFKESAENCVRLKNVTKSGLEYLLTLLDCGLYDAHSDLQIFPMAPSLKTNLEVLLLADRFLFEKLKELLSSAILQFKLGPNTADRIYTWSLSDGMGFLCVEAVAYILTGKMSDENRYQSFSKILNLQYRDQFLEDIKAMLLRQMAK-