Monarch geneset OGS2.0

DPOGS210632
TranscriptDPOGS210632-TA1557 bp
ProteinDPOGS210632-PA518 aa
Genomic positionDPSCF300168 + 575279-581778
RNAseq coverage325x (Rank: top 35%)
Annotation
HeliconiusHMEL0082973e-15170.53% 
BombyxBGIBMGA013583-TA0.067.70% 
DrosophilaTango10-PB1e-9839.76% 
EBI UniRef50UniRef50_E2B6B09e-13549.80%BTB/POZ and BACK domain-containing protein LOC388419 n=11 Tax=Neoptera RepID=E2B6B0_HARSA
NCBI RefSeqXP_624867.11e-13648.74%PREDICTED: similar to CG1841-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3838579094e-13648.36%PREDICTED: BTB/POZ domain-containing protein 17-like [Megachile rotundata]
NCBI nr blastxgi|3838579092e-13348.36%PREDICTED: BTB/POZ domain-containing protein 17-like [Megachile rotundata]
Group
Gene OntologyGO:00055152.1e-18protein binding
KEGG pathway 
InterPro domain[20-148] IPR0113332.1e-23BTB/POZ fold
[43-147] IPR0130692.1e-18BTB/POZ
[48-148] IPR0002101.7e-16BTB/POZ-like
Orthology groupMCL11758 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210632-TA
ATGGATTATTTAACAGTGGATTCTGCTTCTACATCCCTGGTTCAGAGCAAAGATGAAGAGAATGACAATGATGATCTTGAAGTGGACAACTCAAAAAGTGTTTTGCTCAAGATTGCCACACTTTATGCTGAACAGTTGATGAGCGACCTTATTCTGGAGGTGGCTGGTGTCAGCTATCCGGCTCATAGGCTCATTCTGTGTGCTAGTAGTGAAGTGTTCCAGATAATGTTAATGAATCGTGAATGGAGTGAGTGGCGTGAGAGTCGTATAGTTCTTCAAGAGACTCCATCAGCGGTGGCGGTGTTCCCACACTTCCTGAAGTACTTCTACACGGGACAAATCAAGATATCATACACATCGGTGTTGCCAGTACTGTCGCTAGCTGATAAATATAATGTTAAGGACCTTGTGAATCTGTGCTTGGAGTACATGTCCCAGCACATAGCTCAAGCTGCTCGACGTGGCCGCCTTATATCCTGGATGCAGTACACTATGGCATGGCCTGCGTGTGTCCGCTTCGTGAAATGGAACGTTGAATGGGTGGTGGAGGGAGAGCTTGGGGAGCTGGAAGACGACTCACTGCTGCTGTTGATGGACCAGAGCGACCTGGTGCTGCATAACGAGATGGCGCTCTACCAGTTGGTGGTAGCGACCAACTTATTCGTCTTCCTGCGTCTCCAGTCTACAGATGTACCGGAGCAAGATGTCAAGTTGCACTTTGACTCGCTCATAGTAACTGTTTTCTCACATGTCAGGTTCCCGATGATGTGTCCGAACCAATTGGCAAAGTTGTTGCTGTGTCCGCTCACTCAAGAACATAAGGAGTTTTTCATGGAGAGAATGGCGATCGCCATGAGTTACCAGTCAGGTCAGTACGAGCGTATAGCTGAAATCCAGCAGTCCGAGGCTGGTAGGATGTTGTTCACGCCCCGTCTCTACACTGAGGATATCTGGGGTTCCGTACTGGCGGTGGACAACTTCCACTCTCTTCCCTGTTATCACACCAGGACCTTCATATTCTCCACCAGACCCACCATCGCTGACGTCACAGACAAACTCACCGAGTGGACCGTGGACCTGTACCCTAAGGGCGTTTGGTTTAAGAAGAGCATGCTCATTATGTGGGCGGGCAATTATGATGTACCTGAGGTGGTGCTGCGCACAGTTCGCATCTCCATCACGTGTCAGAACGTCCCCGAGCGAGCCTCGCACGATCCCGACGTGAGGGTCAAGATAGGTATACTGGTGTGGGGAGTACAGAACGGCGTGGAGCACGTGGCCTCCGTGGTGGAGAGAGTGCACAGGTTCTCAGCGCAGAACAGGGTACTTAATATAGACGGTGCGCTGGACTTCGACGAGCTGAACAGTCCGCTGTACCGACCCGCCGCGCCGACGAACACGCCTAAGACTGGCGGCCAGCGGTGTCCGAAGTGTTCCGACAACTGTGAGGTGTCTCAGAAGACTCACCTGCTGGGACCGGCCGCCGATCAGCTGAGGATCCAAGTGGTGATAGTGCCCCTGACTGATTTCTGTGACGTCAGCGACACCCGCGGATGA

Protein sequence:

>DPOGS210632-PA
MDYLTVDSASTSLVQSKDEENDNDDLEVDNSKSVLLKIATLYAEQLMSDLILEVAGVSYPAHRLILCASSEVFQIMLMNREWSEWRESRIVLQETPSAVAVFPHFLKYFYTGQIKISYTSVLPVLSLADKYNVKDLVNLCLEYMSQHIAQAARRGRLISWMQYTMAWPACVRFVKWNVEWVVEGELGELEDDSLLLLMDQSDLVLHNEMALYQLVVATNLFVFLRLQSTDVPEQDVKLHFDSLIVTVFSHVRFPMMCPNQLAKLLLCPLTQEHKEFFMERMAIAMSYQSGQYERIAEIQQSEAGRMLFTPRLYTEDIWGSVLAVDNFHSLPCYHTRTFIFSTRPTIADVTDKLTEWTVDLYPKGVWFKKSMLIMWAGNYDVPEVVLRTVRISITCQNVPERASHDPDVRVKIGILVWGVQNGVEHVASVVERVHRFSAQNRVLNIDGALDFDELNSPLYRPAAPTNTPKTGGQRCPKCSDNCEVSQKTHLLGPAADQLRIQVVIVPLTDFCDVSDTRG-