Monarch geneset OGS2.0

DPOGS202706
TranscriptDPOGS202706-TA3867 bp
ProteinDPOGS202706-PA1288 aa
Genomic positionDPSCF300272 - 188418-194343
RNAseq coverage25x (Rank: top 77%)
Annotation
HeliconiusHMEL0175422e-14448.25% 
BombyxBGIBMGA008386-TA3e-7956.78% 
Drosophilaptip-PA1e-0827.72% 
EBI UniRef50UniRef50_E0VLZ11e-3039.34%Cytoskeletal protein Sojo, putative n=1 Tax=Pediculus humanus corporis RepID=E0VLZ1_PEDHC
NCBI RefSeqXP_002427135.12e-3139.34%Cytoskeletal protein Sojo, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420128415e-3039.34%Cytoskeletal protein Sojo, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2700094773e-4621.11%hypothetical protein TcasGA2_TC008741 [Tribolium castaneum]
Group
Gene OntologyGO:00055155.6e-21protein binding
GO:00056221.1e-18intracellular
KEGG pathway 
InterPro domain[34-127] IPR0089845.6e-21SMAD/FHA domain
[1110-1200] IPR0013571.1e-18BRCT
[33-120] IPR0002531.4e-17Forkhead-associated (FHA) domain
Orthology groupMCL25474 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202706-TA
ATGGAGCTAACTCAAAAACTCGAGTGTACACAAGAAATATGTGCGAATAACTATGAAAAAGAATTTCCTGAACAGATTGGCTTTTTGGGTATTTGTGGTGTTAAGTACCCTTTGAAGAAAGGACCAAATAAGATAGGCAGAGATCCCGATACGTGCAATATTGTCCTTAATTTAAATTCTATATCGAGGCAACATGCGGTGATCAACATTTTAAACAGTTATGACTTCATGCTGATGGATTTAGATTCTGCAAACAAAACGAAACTAGCAGATAAAACATTACAAGCATATATTCCGCACCCAATTAAAAATGGAGATATGGTACAATTTGGCCAGGTGTTTGGTGTGTTCCGATTATTTGAGGAAGACAATGATCTACCCATGACCCAAGCACTGGATGTACCAGAAACTCCTGTGAGGAACCAGGCGGTTTATAAAGTTAATAACCTGATAACTACAATTCCAGAATCTCCAGATGTCAGTGATAGGGATGACTCCTTCATTGTGGCCTCACAACCCAAACCAAATAATGTATTCAAAAGTCCAAAGACCAACTTCATAAAATCATCTGGGAAAACAGTACATATAAAGCCTGTTGGTTTCAATAAAATAGATAACGCCTATTGGAGTTCATCTAAAAAGTCCGATTCATTCAGCTTAGAAACTGATACATCTAGAAATGATTCAGACATTTGTTTGAAAACTTCTGAAATTAATCAAAATATTCATGAACAAGACACCCAATGTACTGGTTACAATAAAACAGACAATTCCATTTATGAAGCTGAAACACAAATAGACAATGTACAAAAATCAGTTCATAACTTGGAAACACAACTACTTGGAATCAATAATTTGCCAGAAGTATTCAAAATACAAGGTCAAGAAAATATCAACTTAGATTTGTCTACAGAGAACAAAGAAAATGAGGATGTATATCAGTCTGACAAAGAGGTTTTATTACATAAAGCAATGGGTAGTGCTGTCAGAGATAATGAAAGTTGCAACAACAAAGATATATCTGATGATATAATATTATTTGATGAGATTGATAGTCCAGTAGGTGAAGATAACATTGAATCACAACAACTTCTCATACCTGAATTAGAAACGGAACATGTTGAAATAGTAGGAGACAGTGAAAAGCCTTCAAACAAGGAGGAGGAGCCATCAAAAAGAAGAGATTCTGGAAGCTCCACAGATTGTGAGGATATTTATTTCGCGCTAACACAGAATTTACCAGAAAAACGAATTGAAGATGATGAAACCGACTGTGAAGATGATCCTGAAATAATTGTTAAAGTAAATAAAGAACAAAAAACAGTGGAAAACTCTAACTTAGACGACGATCTAACAGACTGTGAAGATGATATTGAAAGCAAAGCCCAAACAGATCCCGGCTTAGAGGATTTAGCTACGCAAATTGTAGAAGATGTTAGCCATTGCAAAATAAACTCTTCGAAAACGACATCAGATGACCAACAGGGTCCAAATGAAATAGACCTGGAAGATATGCCCACACAGATAATTAGTTACGAACCGCAGAATATTTACTCTGAAGAAGTTGTCACTCCGTTTAAAGTTCCCGCTATATCTCCTATGAAACGTAAGAGAAAAGAACAGGAGCCAAAAGCACTCACAATTATTAATAATAAAAATATAACTGCCAACATAGAGAATGACGATGAGGAAAATTATTATGACGCAACGCAAAAATTATGGGACGATTTATGTACTCAAAGGGAATCATCCCCCACATTATCTTGTGGAGCCAATAAGGATGTATTAGAAACACAGATTAATAAAAACATTATTATTAATCCACCCGATTGTCAACCAGAGACATATGATATTGATGAAAAAATTGATAGATTTGTAATGAATTTGACTAATACGGAGAAAACTTCAATGGTCGGTGCGAAAACAGGCGTGGAATCCAAGAAGTCATCTAGTGATAGCTCTGATGTTGAAACAACACCGAGGAAGTTAAATCCTCTTTCCTTTCTAAATACCGAGCTACCCAACACACAAGAAATAAAGTTGTATGTAAAGGCAACCGGTGATACCTCACCGGAAACCTCTTTTGACTTAGATACAGATAGCTCAGATTGTGAACGACATAGAAAGAAAAGAAATACAAAGAATAAAATTGTTAAAAGATCAAATCTGATCACAAGGTTAGAGTCTGAGAAATCACCTGAAAGAATAATAAATCCCGTGAGACAACCTGAGAATAAAATCAAGCACTCTAAAGACATCGATAACAAAAAACCTAAGAGAAATACTGGATCTGACTTGTTAGGAAATGAAAATAATAACGACGGAAAACTAGTCACAGCTAACACATCAGAAGTTAAAGAAAAATCGACAAGAAGCACGAGAAGTTCAGATATTATTGACAAAAAGGACAGTAAAGAATTAAATAAAAGCAAAGCAAGTTGTAAAAATACAAAGGAAAGCAAAGCAAAAGGAATAACAGATAGCAAAGAAACTGCGAGAAGGGGACGCAAAAATAATATTACTATAGAAAAACAAAACAGCATCGTCAAAAAAACCACTGAGGGCGATATTAAAAATAATGTCACAGTTACGGACAGAGAACAGGCAAAAGAAAAAAAGGGTAACAAAAAATCCACAGACACTAATGACAAAGAAAAACGTAACAGCAGTAGAAGTGCGAGTAAACAAACAAAATCAGATAATAACGAGCCCAAACTCCCACAAGCGAAAGGAGACTCAAACAGAAGTAGTCGCAGCAAAAGCAGGCGAAAAGAAACTAATACGATCGACAAATTTCTATGTCCCACTCCTGTTATTGAAGCTCAGAAGAAAAATGTTAGCAAAAGTAGAAGTACAGAAAAAGAAATCAAAAACACAGAGCAGGAAAACAATGGAAAAGATAGAGGAAACAGAAGAAAAGATAGCAAAAGTACAGACAACACAAGCAGAAATAGTGACAAGGTAAACAAAAACAAGGATATAGATACTAAAGAAAAGAACAGAACAAGTGCAGAGACGAGAAATCAAACAAGGAGTATGGAGAATAACAAAAAGGAAATACCGGTGGAATTAGAAGTTAGACGGAGCAAGAGGCAAAGAACGGCTAAAAGGAGCTTTGAGATAGAATCAAGGACATCGAAAACTAATCATGAACAGAGCACAGTGTATAATATCTCATCAGAATCTGGTATTGACTCTCCCAACAAGCTAAAGAGACAGGCTAGTGATATCAGTCTACCGAGCTCTAAGAAGACCAAAACGTGCAACGGTTCAAATATCACATTAAGGGCCACGCCTGCTAGAAAGATTAAGACGCAGTACGTACTCTTTACAGCATTCCCATGTGACGAGGTCAAAGTTAAATTGGAAAAGTTGGGGGCAGTTATTGTAACTGACATAATGAAATGTACAGTAGTCCTAACTTTAGAAATTAAAAGGACATTTAAGTTGCTCTGTGCTGTGGGTCTTGGTAAGCCGATAGTCGGACCTCACTGGGTTCAGGCTTGTGTTGACACTAATATGATTGTTGATCCTTGGTTATATCTCATCAAGGACGAGAAAACTGAGAAGCGTTTTCAGTTCAATCTGGAGCGTATATTAATTGGTAAGAGACAATTTCTGAAAGGCTACAACGTGTCGTCAACGCCAAATGTAATGCCAAGCCCTCCCGAGATGAAATTAATAGTGGAGTGTTCAGGCGGAACCTGGACAGCAGGCGGGAAAAATTGGATATGTGTGTCCTCTAACACTGACAGAGCTCTATGGGACGGGCTCAAGCGCAAGGGCGCCACTATAGTGTCAACGGAATTCGTTTTGGCGGGAGTTTTACGACAGAAAATAGATATCAATAGAAATATACTGTTGTGA

Protein sequence:

>DPOGS202706-PA
MELTQKLECTQEICANNYEKEFPEQIGFLGICGVKYPLKKGPNKIGRDPDTCNIVLNLNSISRQHAVINILNSYDFMLMDLDSANKTKLADKTLQAYIPHPIKNGDMVQFGQVFGVFRLFEEDNDLPMTQALDVPETPVRNQAVYKVNNLITTIPESPDVSDRDDSFIVASQPKPNNVFKSPKTNFIKSSGKTVHIKPVGFNKIDNAYWSSSKKSDSFSLETDTSRNDSDICLKTSEINQNIHEQDTQCTGYNKTDNSIYEAETQIDNVQKSVHNLETQLLGINNLPEVFKIQGQENINLDLSTENKENEDVYQSDKEVLLHKAMGSAVRDNESCNNKDISDDIILFDEIDSPVGEDNIESQQLLIPELETEHVEIVGDSEKPSNKEEEPSKRRDSGSSTDCEDIYFALTQNLPEKRIEDDETDCEDDPEIIVKVNKEQKTVENSNLDDDLTDCEDDIESKAQTDPGLEDLATQIVEDVSHCKINSSKTTSDDQQGPNEIDLEDMPTQIISYEPQNIYSEEVVTPFKVPAISPMKRKRKEQEPKALTIINNKNITANIENDDEENYYDATQKLWDDLCTQRESSPTLSCGANKDVLETQINKNIIINPPDCQPETYDIDEKIDRFVMNLTNTEKTSMVGAKTGVESKKSSSDSSDVETTPRKLNPLSFLNTELPNTQEIKLYVKATGDTSPETSFDLDTDSSDCERHRKKRNTKNKIVKRSNLITRLESEKSPERIINPVRQPENKIKHSKDIDNKKPKRNTGSDLLGNENNNDGKLVTANTSEVKEKSTRSTRSSDIIDKKDSKELNKSKASCKNTKESKAKGITDSKETARRGRKNNITIEKQNSIVKKTTEGDIKNNVTVTDREQAKEKKGNKKSTDTNDKEKRNSSRSASKQTKSDNNEPKLPQAKGDSNRSSRSKSRRKETNTIDKFLCPTPVIEAQKKNVSKSRSTEKEIKNTEQENNGKDRGNRRKDSKSTDNTSRNSDKVNKNKDIDTKEKNRTSAETRNQTRSMENNKKEIPVELEVRRSKRQRTAKRSFEIESRTSKTNHEQSTVYNISSESGIDSPNKLKRQASDISLPSSKKTKTCNGSNITLRATPARKIKTQYVLFTAFPCDEVKVKLEKLGAVIVTDIMKCTVVLTLEIKRTFKLLCAVGLGKPIVGPHWVQACVDTNMIVDPWLYLIKDEKTEKRFQFNLERILIGKRQFLKGYNVSSTPNVMPSPPEMKLIVECSGGTWTAGGKNWICVSSNTDRALWDGLKRKGATIVSTEFVLAGVLRQKIDINRNILL-