Monarch geneset OGS2.0

DPOGS211841
TranscriptDPOGS211841-TA4887 bp
ProteinDPOGS211841-PA1628 aa
Genomic positionDPSCF300031 + 1048872-1057513
RNAseq coverage266x (Rank: top 40%)
Annotation
HeliconiusHMEL0164920.076.46% 
BombyxBGIBMGA006025-TA0.070.34% 
Drosophilac11.1-PA1e-15226.45% 
EBI UniRef50UniRef50_D6WKM30.028.53%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WKM3_TRICA
NCBI RefSeqXP_001648365.11e-16827.54%hypothetical protein AaeL_AAEL004045 [Aedes aegypti]
NCBI nr blastpgi|2700065600.028.53%hypothetical protein TcasGA2_TC010431 [Tribolium castaneum]
NCBI nr blastxgi|2700065600.028.41%hypothetical protein TcasGA2_TC010431 [Tribolium castaneum]
Group
Gene OntologyGO:00054882.7e-21binding
KEGG pathway 
InterPro domain[752-1628] IPR0160242.7e-21Armadillo-type fold
[1313-1626] IPR0119892.9e-19Armadillo-like helical
Orthology groupMCL14616 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211841-TA
ATGTCTTCTTCAATATCAGAATACTTAAAAGATACAGTGACTGTGCTCATAAACGCTATTGGAGATAATGATAATACAGTCAACAATGTTATTATTAAATCTCTGGTGAAGATATCAAATGTACATCCAAATGAAGTTATCGAGATATTTTGCGAATTCTACCGCAATACAGTTAAGAGTAATGTTATTCAGTTGGGAAACCTAGTAAAGGTATTGGAACAAACATGTGTGAATCAGCTCAAGAAAATCGATCAAAATGTAGCTAAAGAGCTCATTAATGCGATGCTCCGTGCCATGATGGAGAATCCTATGTATGAGCCGGTAGTTCAGATGGGGGCATCGTCTGTTCTAGTTGCTATCAGCCATGAGTACTTGGATCTGGTACTCCGATCTTTAATCGATCAGATATCACCCGCTTCCTTGCCGCATTACACGATCGCGCACACACTCGGCACGCTAGCGGCGGTCAACACGCACGAACTTGTACCACACGTCAAGGAGATCCTTGGCAAGATGCTGCCGTTACTGCCTCTCGTTAAACAGGATGGATCCAAGCAAGCCTTTGCTTATGCGTTCGGCCACTTCGCTGTAGCCGTGTCCGAACAAATAGGCGACAATGTTGCAAACGACAATATAACATCCATAAAGGACAATTTTGTCACGGAATTCACTATAGTCTTCGACGTGTTGTACAATCAATGGCTGCCATGCCACGAGCCCAAAGTATCAGAATCGGTACTCGAGGCCCTCGGCCCCATCACACGGCTCATATCAGAGAGACAGTTCAACGAAACGGTCAACAAGTTCGTCCTGTCACTGCTGTCCTTATATCGCAAACCAGCTATCAACTTCTACTGCATCAGCCAGTGCATATCATACTTATTGTCACCTTCGCCCTTAAATCCAAAACTGACGTTAAATGACAACGTCATAAATTCCATCAATAACGTGTTGTTTAACCTGGTGCTGTTGGAGCCCGATTACGATCAGCCGCACACCGTCAAGAATCACTTTGAAGTACTGCGCTGCTTCGACCACATGACGACGCAGTTCTCAGACCAAACCGTCGAAAGTCTGTTGCATCACTGTAAACATAACCAAGAGAAAGACAGAATGAAAGCTGTCATAATCCTGACCCACCTAACGACGTCCTCGCAGATATTTATCGAACATTTCTCGTCAAAGTTCATAACGATTTTGAAAGTGATGATCGTGATGGAACAGGGCGTGAAGATGAAGAAACTCCTCGTCAAGGCCATAGTGGGCCTCGTGTACAGGAACTGCATAATGTCGCCGGAGGATTTTGTGATGGTGGAGTTTATAATAAAGCACTGCGGATACGAGGGTACACCCACCACAACGAAGGCCGAAGTTCTGGATCTACAAGACACTTGTAAAAGTTCCCTCGTACTCATGTGTAATACTGTCACCAGCGTGCGCTCGCAGCTACGTAATTTGTTATTGAATTCGCTAACTGTCGACGAGTTCACCTCTTCCATGTCGACCGTCAGCCACTGTCTGACTTCCCTCCTTCAGAATAATTCAGACGTAATTCCCGACGAGCAGTCGGACAGAACAGAGAGGTTGAAATTAAACTGTTCACCAGATTTAGTTTTTGTCAGATGTTTGATTCAAATTGCGGACCCTGACCGAACCGAAACTAATAAAAACGTACTCGTCTTCTTGGACGAATTTTCCGGTGACGTTCATAAGAATTTGAAAAATAAGTGGACCATCGAAATACAGCGGCTGTTGAAGTTTGTGGAAAAAAACGAATCCAAAGAACAATGGCACGGGATGCTTTTGGATCTCCTAGTGTCAGCCATAGAAGAAGTTAACAGTAACAAATGGGTGGAAACGATATCGACATTGGTGTCGCAGCAGATTATGGGGAAAAAGCAATCGCCGATGTTTAAAGGAGTCTCACTTCAATATTTAGCGATACTTTCATGTCACTTGTCTAATGCGGCCGTCGTAGAAACTGTTCTAAAGATAATACTGCTCGCTTTGAAATCGATCCCCATGTCCAGTGTGGACTGTGTGAGCAAAGCGGTCGGCATCGCGTCGAGGGTTCACGGAGAATGTGTTCTCAACGAGCTTGACGCTATCTATAAAGAAAATGAAGCGAGACGCGGCAATAAACTCCTGAATTTCCTGTCAACCCGAGCCTCGAAGAACGAGCTCGAATTATCCATAGTTAAATATGCCGTGATAACTTGTTACGGTAAAGTAGCGAACGAGTGCTTGGATGTTCACGTCCTGGCCAGGTTAGGCGAAAATATAACTTCCATACTCTATGAAATATTGAAATCGAATCCGCCCTTCGATTTGTGCAAAGCCAGTGTGACGACTCTTTATGAAATAAGTAAGGCTTTGTATCCAGCGGCACATCACAATGTGGCGTTAAGAAATCGTTGGCAGCTATTGAACGCGGTGCTGGAACAAATATATAACAGTAACCTCGACAAGCGACACGTCGAATTGTACCCGATTGTGGTGAAGGCTTCTAAAGCGTTGACAAAATTGCATAGAGTCTTATTCAGCAGCATCTTCGGTGAACTGTCCTCATATAAACAGAAATATGAACTCGAAAAGAACGGCGATCAGAACGACTTGCTCGCCAAAAAATTGAACGATTCCCTCACACTGTTACATGAACTAATCAAGGAATTGATCATTCAGTCAACATGCCTCAGTACTATCGACGATATTGTCAGTCTATCGATCGAGTGGATCCGCCACGACAATGACGAAATCCGCACCGCAGCTACGCTAATACTTCAAGTGGTTTTCGATGCATACATTAAAAACGTCAAGCTCAACTACGAAACTCCAAGTAAATTCGGACAAATGGGATATCTTTTGGGCATCATCGTACCTGGAGTCGCCGACTCGAATTTCGCTGTCAGATTGACGACGATAGACTGCATCAAGCTCGTCATACAGATACAGGATCTGTACGAAGGGCACACCGTGAATCCCGAGGACGAATGCATAAATGGTCTATCGAAGTTACAGAATAACGTACTAACCAACGACTTGAACATGATAAGCCACTACTGCACCAATCTCTGTGATATGATTTCTCATAAAATACCGCATTTGCATAACATGCAGTTTGTGGAGAGTTTGTTGGACGGCTACGACGACCAGGAGTTCAGGAGCGTCGGCATCAGCTCTGTACTGGACGCCTTCATCGTCCAGAAAGGTCAAGACCTGTTCCAGAACATCGAGAGGATAGTAGAGGTCCTGTTGACCACCATGGACGAAGTCGGCCACGACGCTCGCATCAGACTCATGAGGCCCGTGACGTCACTCACGCGTCATCACTCCAATGCCGTGACAGCCGTGCTGTTGGGACAAAAGCTGCCTTTAATACCTTCCGTCATATCTTGCTGGCGTTATCTGGCCCGCGACGAGAGTCTGTCTTCGGCTATAGTCGATAACTTCTTGCGACTGATGACCTCCATCGAGCTTTACGAGGATCCTTATCACATCACGGAGAATCACGTGGCCGCCTTGCAGCCTTTGACCCTGATAAGCGCCCTCGGAGAGATGCTGCAGGAGTTATCGATGAGAGGAACGTGTCTCGCCAAGTTCCCGGAGCTGTTCTCAGTACTGTACACGACTCTGGCGTGTTACATAGAAGCCGAACCACCAGCGTACAGTCTGCCGCACAAGAACCAGGACCGTTTCGGATTCATACCCAACAGAGAAGCCATCAAGTTGTCGCCGGCTCGAATCACCATCAACACCTTCAACGTCTTCCTCGAGAGAGCTGACTGTTACAAGGTGAAGGAGGCGTGTTCTCTGTGTCTGAGCGTGGAGCACGGTGACAGCACCGCCACTCTGTTGGAGCTGGCTCCGATTCTGTCCGCGGCGATCAGTGCCTCACCCGCCCTGCCCAAGGTGGTGTCGCGGGTGGCGCTCTACTCGCGGTCACCCCTCCCGCCACAGAGATGTGCTGCACTCGCCTTATTGGCGGATCTCTTGAATTACAGATGTAACGACAACAGCGTGTTGATCGAGACGGTGTTGTCGTCGCTGTCCACCGGCTGGAAGGACGGGGACGGGCGAGTGAGGTCCGCGTGTCTACGGACCGCCGCCAACGTGTGCCGCCTGGCCCGGGAACACAGGGAACACGCCGTGCCCGCCGCACTCACCGCGCTCAGCCAGGGAATAGACGCGCACACCACCACGTCGACCGTAGACAACGTCCCCCTAGCTGCCATCCAAGGCCTGAGCCGACTGCTCATGGAGACCGAGGACATAGACAAGGAGTTCGAGAGAGAGCTGGTGGCCGTCTCTCAGAAGATACGACCCTTCATGAACACGGACTGCTGCCACCTGAGAGAGAGCTCCATCAGACTGTTCGGGCTGGTGGCGGGCAGGCTGCGGCGCGGCTCGCTGGGCGACCAGGCCGTGAGCTGCATACCCTGCTTCCTGCTACATCTGTGTGACACCAACCCCGCCGTCGTCAGGGCCAGTAAGTTTACCCTGCGCCAGGTGTTCAAGACGTTCAACGTGAAGAAATCCAACGACTTCGTCCAGGCGCACCTGGTGGACGAGGGCCGGCTCTATCTGGACGAGTTCCTGTCGGCTCTGCTGCGCCACCTGGCGGACGAGATGCCCGCCGCTATCGTCAAATGCATCGAGACGGCCGGCAACTACCTACAGGGCGCGAGAGACGAAATAAAACCACACGCACCCCTGTTGATAGGTATTCTATATTCCGAGCTGGCTCGCCTCCCGGCCTCTGAGCGCTCGTCCCTGGAGCGTGACGTCACACAGCGTGCCAGGACGCGCCTCCTGGCGCTCCTCACCGACAGCGACGCCAGAGTGAGACAGAACGCTGCCGCAGCACTCGCTAACATGTGTCTAGTGTAA

Protein sequence:

>DPOGS211841-PA
MSSSISEYLKDTVTVLINAIGDNDNTVNNVIIKSLVKISNVHPNEVIEIFCEFYRNTVKSNVIQLGNLVKVLEQTCVNQLKKIDQNVAKELINAMLRAMMENPMYEPVVQMGASSVLVAISHEYLDLVLRSLIDQISPASLPHYTIAHTLGTLAAVNTHELVPHVKEILGKMLPLLPLVKQDGSKQAFAYAFGHFAVAVSEQIGDNVANDNITSIKDNFVTEFTIVFDVLYNQWLPCHEPKVSESVLEALGPITRLISERQFNETVNKFVLSLLSLYRKPAINFYCISQCISYLLSPSPLNPKLTLNDNVINSINNVLFNLVLLEPDYDQPHTVKNHFEVLRCFDHMTTQFSDQTVESLLHHCKHNQEKDRMKAVIILTHLTTSSQIFIEHFSSKFITILKVMIVMEQGVKMKKLLVKAIVGLVYRNCIMSPEDFVMVEFIIKHCGYEGTPTTTKAEVLDLQDTCKSSLVLMCNTVTSVRSQLRNLLLNSLTVDEFTSSMSTVSHCLTSLLQNNSDVIPDEQSDRTERLKLNCSPDLVFVRCLIQIADPDRTETNKNVLVFLDEFSGDVHKNLKNKWTIEIQRLLKFVEKNESKEQWHGMLLDLLVSAIEEVNSNKWVETISTLVSQQIMGKKQSPMFKGVSLQYLAILSCHLSNAAVVETVLKIILLALKSIPMSSVDCVSKAVGIASRVHGECVLNELDAIYKENEARRGNKLLNFLSTRASKNELELSIVKYAVITCYGKVANECLDVHVLARLGENITSILYEILKSNPPFDLCKASVTTLYEISKALYPAAHHNVALRNRWQLLNAVLEQIYNSNLDKRHVELYPIVVKASKALTKLHRVLFSSIFGELSSYKQKYELEKNGDQNDLLAKKLNDSLTLLHELIKELIIQSTCLSTIDDIVSLSIEWIRHDNDEIRTAATLILQVVFDAYIKNVKLNYETPSKFGQMGYLLGIIVPGVADSNFAVRLTTIDCIKLVIQIQDLYEGHTVNPEDECINGLSKLQNNVLTNDLNMISHYCTNLCDMISHKIPHLHNMQFVESLLDGYDDQEFRSVGISSVLDAFIVQKGQDLFQNIERIVEVLLTTMDEVGHDARIRLMRPVTSLTRHHSNAVTAVLLGQKLPLIPSVISCWRYLARDESLSSAIVDNFLRLMTSIELYEDPYHITENHVAALQPLTLISALGEMLQELSMRGTCLAKFPELFSVLYTTLACYIEAEPPAYSLPHKNQDRFGFIPNREAIKLSPARITINTFNVFLERADCYKVKEACSLCLSVEHGDSTATLLELAPILSAAISASPALPKVVSRVALYSRSPLPPQRCAALALLADLLNYRCNDNSVLIETVLSSLSTGWKDGDGRVRSACLRTAANVCRLAREHREHAVPAALTALSQGIDAHTTTSTVDNVPLAAIQGLSRLLMETEDIDKEFERELVAVSQKIRPFMNTDCCHLRESSIRLFGLVAGRLRRGSLGDQAVSCIPCFLLHLCDTNPAVVRASKFTLRQVFKTFNVKKSNDFVQAHLVDEGRLYLDEFLSALLRHLADEMPAAIVKCIETAGNYLQGARDEIKPHAPLLIGILYSELARLPASERSSLERDVTQRARTRLLALLTDSDARVRQNAAAALANMCLV-