Monarch geneset OGS2.0

DPOGS210911
TranscriptDPOGS210911-TA1371 bp
ProteinDPOGS210911-PA456 aa
Genomic positionDPSCF300045 - 22193-33493
RNAseq coverage72x (Rank: top 66%)
Annotation
HeliconiusHMEL0158120.084.34% 
BombyxBGIBMGA003094-TA4e-14581.85% 
Drosophilachinmo-PE2e-4565.65% 
EBI UniRef50UniRef50_D6WJ476e-7942.29%Chronologically inappropriate morphogenesis n=4 Tax=Endopterygota RepID=D6WJ47_TRICA
NCBI RefSeqXP_970312.26e-7942.06%PREDICTED: similar to bmp-induced factor [Tribolium castaneum]
NCBI nr blastpgi|2700081832e-7842.29%chronologically inappropriate morphogenesis [Tribolium castaneum]
NCBI nr blastxgi|2700081832e-8043.27%chronologically inappropriate morphogenesis [Tribolium castaneum]
Group
Gene OntologyGO:00055155.2e-20protein binding
KEGG pathwaydme:Dmel_CG114912e-26 
 K02174 (BR-C)maps-> Dorso-ventral axis formation
InterPro domain[30-146] IPR0113331.3e-25BTB/POZ fold
[50-148] IPR0130695.2e-20BTB/POZ
[58-156] IPR0002108.2e-18BTB/POZ-like
Orthology groupMCL17160 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210911-TA
ATGGTTCGAAATTATATAGACCGGTCAGCTTTACGGTGTTCTCGAAAGAGGGCAGGTATACGAGGCGGCCCGGGCATGGACTCCCAACAGCAGCAGTACTGCCTGAAATGGAACAGCTTCGGCTCCAACCTGGCCACCTCGTTCGCGAACCTCTGGAACTCAGAGAGCCTCGCAGACGTGACTCTCTATTGCGAAGGTCGTCAGTTTAAAGCACATAAAGTTATCCTGGCAGCATGCAGCAAACACTTCCAGGAGTTATTCGACACTGCACCGCCAAGTCATGCTGGCGCCTGCTATGTATTTCTCGAAGCTACCACCGCAGACAACATGCAAGCTCTCCTCGAGTTCATGTACAAGGGAGAAGTCCATGTCAGTCAAGACGCTCTCTCCAGCTTCTTGAAAAGTGGAGAGAACTTACAGGTTAAAGGTTTATCTATGGAGATGTCCCAAGATGCTTGGGTGAAACAACAAACTCAGCAATCATCTGAACGTCATCAGACTCGCATCAAGACAAGCCCCGTATCTGGTTCAGGGAACTGTGACATTCAGGACGCAGCTCCCCCTCAGACTCACGGAGCTACTTTTGCACCAATTGGCATGCCTCAATATGGCATGAATCTACATAACAGCAGCGGTAGTATGAGTCGCTATGCTCCACCAGCTCATGTGCCCATACATTCGTCTACACACCGTAGACCACCGCCACCAAAACCATCACCGTCTCAAAGTCCACATGCTAGGAATTACAGACAGAGTTCATCATCTTCTCATTGCGGCAGTGTCGCTGATGAGCCCGACACTCGTTCATCACCAGGCGCTGGCGCTAGATTTGAAGAAACTCCTCCGGACTGTACTTCTAGTCTTGCTAACCCTGTGAAGAACAACGGATATGAACGAGCTGCTTCTGTAAATGATGAAATGATTGAACGCCTCTCAAATGATCAAGGAGCTGAAGATTTGCGGGTTAAAAATGAAAATGATTACCATGCTAGCAGTTATAACAATTCACCACCGCCCGCTATTCCGATACCGACCAGCAACTACCACGAGAAGCCACATGAGACCTCTCCAGTACCACCTAAACCTAGCCCCGACATATGGCCCGCGAAACTGATTACGAGCAAATCCGGTGGAATTGCTACTGCTGATGGTAAAAAATTAAAGTGTCCATACTGCGAGCGATTATATGGATATGAGACAAATTTGCGCGCTCACATCCGACAGCGGCATCAGGGGATCCGCGTGCCATGCCCTCATTGCACGCGCACATTTACGCGCAACAACACCGTGCGCCGCCACATTGCACGGGAGCACCGTCACCAGGTCACTCCTCATCTACCACAAGGCCCTCTCCCAAATCACACGCAGTAA

Protein sequence:

>DPOGS210911-PA
MVRNYIDRSALRCSRKRAGIRGGPGMDSQQQQYCLKWNSFGSNLATSFANLWNSESLADVTLYCEGRQFKAHKVILAACSKHFQELFDTAPPSHAGACYVFLEATTADNMQALLEFMYKGEVHVSQDALSSFLKSGENLQVKGLSMEMSQDAWVKQQTQQSSERHQTRIKTSPVSGSGNCDIQDAAPPQTHGATFAPIGMPQYGMNLHNSSGSMSRYAPPAHVPIHSSTHRRPPPPKPSPSQSPHARNYRQSSSSSHCGSVADEPDTRSSPGAGARFEETPPDCTSSLANPVKNNGYERAASVNDEMIERLSNDQGAEDLRVKNENDYHASSYNNSPPPAIPIPTSNYHEKPHETSPVPPKPSPDIWPAKLITSKSGGIATADGKKLKCPYCERLYGYETNLRAHIRQRHQGIRVPCPHCTRTFTRNNTVRRHIAREHRHQVTPHLPQGPLPNHTQ-