Monarch geneset OGS2.0

DPOGS209727
TranscriptDPOGS209727-TA1446 bp
ProteinDPOGS209727-PA481 aa
Genomic positionDPSCF300105 + 201994-205995
RNAseq coverage359x (Rank: top 33%)
Annotation
HeliconiusHMEL0077464e-9691.32% 
BombyxBGIBMGA008942-TA2e-9468.67% 
DrosophilaCG16778-PD2e-5069.42% 
EBI UniRef50UniRef50_D6WX423e-5269.92%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WX42_TRICA
NCBI RefSeqXP_971045.15e-5359.54%PREDICTED: similar to Tyrosine kinase-related protein CG16778-PB [Tribolium castaneum]
NCBI nr blastpgi|910888499e-5259.54%PREDICTED: similar to Tyrosine kinase-related protein CG16778-PB [Tribolium castaneum]
NCBI nr blastxgi|910888491e-6041.85%PREDICTED: similar to Tyrosine kinase-related protein CG16778-PB [Tribolium castaneum]
Group
Gene OntologyGO:00055151e-19protein binding
KEGG pathway 
InterPro domain[223-336] IPR0113331.2e-26BTB/POZ fold
[242-338] IPR0130691e-19BTB/POZ
[251-346] IPR0002102.4e-18BTB/POZ-like
Orthology groupMCL25555 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209727-TA
ATGACAAGTATTCATATTCATCGAGTCACAGTACAGGGGAAGTTCGAAACTAATCTAAGTGCCAAGCTCGTACAAGTTGGCAGACAAGTGCCGACGTGGCCTTTTTTTAGTTTTCAGCCTTCGCGATACACCCGCATCGACAGACTAGACATCGCTGGTGTCGACACAGCACTGCAGACACACCAGCGAGACACTAGACACTTGACACTTGACACGGGCGGGGCTGAGGGCGGGACCGCGCCGGCCGCGCCACTCGCACTCGCACTCGCCTCGCACCGGTCTCGGAGCGAACCGGGTCGACAGCGGAGTACCGCGCTCGTCGCTCAGCTCACTCCGCGCTCGCGAGCCGTCCGCGTAACACGTCCGCGTCACTCTGTCCCGGGACACCACACACCACACCGAGAGACAGACCGACCGACCGCCCGCCGCGGCCAGCCGACCGTCTACCGCCGTCGAGTGCTCTGTGCAGTGATTGCGAACTTTAACGTTTTGAGTGATGACAACGAGCTCCAGAGACTTCTGAGAGAAAGGGTCGGCGTGCAGGGCGGCGCGACAACGCGCACGCGCCCGCGGCCACGAGTCGCCGCGACCCGCACACGCGTCCAGACCGGCCAGTGGCCAGCGGCCACCCTCAAGACAGCCATGGACGACAAGCAGCCCGCCGGCGAGGAGCACTACAGTCTGCGCTGGAACAACCACCAGGCACATCTGCTCCGCTCCTTCGAGGCACTGCTGCACGCAGAAACACTGGTCGACGTCACCCTCGTGTGCGCCGAGAGGAGAGTGCGCGCCCACAAGGTGCTGTTAGGAGCGTGCAGCCCTCTCTTCAGGAGGATATTTAGCGAGAACCCGTGCAAACACCCGGTGATCGTGCTCAAAGACTTTCAAGGATGGGAGGTGCAAGCGGTTGTAGACTTCATGTACCGCGGAGAGGTTTCCGTAGCGCAGGAGCAGCTGGAGACGGTAATTCGCGCGGGAGAGTCGCTGCAGGTCCGCGGTCTGGCCGACCAGGAGGCTGCCGAGTGCGGGGAGTCGGAGAGGTCGCCGGTTGGAAGCCCTCCGACTGGGACAGCGGCCGCATCACCGCCCGCAAGCCCGCATAGCCCTCAACCTCGCCGCAAGCAGGCCCGCCCGAGAAGACGCTCCGGGGAGTCTGACACGCCGGAGAACCTGTCCATGAGACGATCACCGGGCGGCCTTAAGGCGGTGCGACTCGCCCGTGGCAGGTCTCCACACAAGCAGGAGCCCGAGGAGCCGGAGACGGAGGCCCCGCCGCCGCCACGTATGTTCCAGCCGCACCAAGACATGTTCCCTCCGGTGCCGCCGCCCGCTGTATCAGCCCTGTCTCTCACACCACCTCACAGTAAGTACCTCGGCATCATATCTGTAGCATTCACTAACAAACATTCCTCTCAACGACTCATCTCACATAGAACATATAAATAA

Protein sequence:

>DPOGS209727-PA
MTSIHIHRVTVQGKFETNLSAKLVQVGRQVPTWPFFSFQPSRYTRIDRLDIAGVDTALQTHQRDTRHLTLDTGGAEGGTAPAAPLALALASHRSRSEPGRQRSTALVAQLTPRSRAVRVTRPRHSVPGHHTPHRETDRPTARRGQPTVYRRRVLCAVIANFNVLSDDNELQRLLRERVGVQGGATTRTRPRPRVAATRTRVQTGQWPAATLKTAMDDKQPAGEEHYSLRWNNHQAHLLRSFEALLHAETLVDVTLVCAERRVRAHKVLLGACSPLFRRIFSENPCKHPVIVLKDFQGWEVQAVVDFMYRGEVSVAQEQLETVIRAGESLQVRGLADQEAAECGESERSPVGSPPTGTAAASPPASPHSPQPRRKQARPRRRSGESDTPENLSMRRSPGGLKAVRLARGRSPHKQEPEEPETEAPPPPRMFQPHQDMFPPVPPPAVSALSLTPPHSKYLGIISVAFTNKHSSQRLISHRTYK-