Monarch geneset OGS2.0

DPOGS215314
TranscriptDPOGS215314-TA951 bp
ProteinDPOGS215314-PA316 aa
Genomic positionDPSCF300120 - 146021-147422
RNAseq coverage3x (Rank: top 91%)
Annotation
HeliconiusHMEL0153964e-14283.45% 
BombyxBGIBMGA007623-TA1e-10267.93% 
DrosophilaCG11085-PA2e-3793.51% 
EBI UniRef50UniRef50_E2BTC87e-3879.21%BarH-like 2 homeobox protein (Fragment) n=3 Tax=Harpegnathos saltator RepID=E2BTC8_HARSA
NCBI RefSeqXP_396835.35e-3876.15%PREDICTED: similar to CG11085-PA [Apis mellifera]
NCBI nr blastpgi|3072011382e-3779.21%BarH-like 2 homeobox protein [Harpegnathos saltator]
NCBI nr blastxgi|3072011384e-3675.23%BarH-like 2 homeobox protein [Harpegnathos saltator]
Group
Gene OntologyGO:00036776.7e-24DNA binding
GO:00063556.7e-24regulation of transcription, DNA-dependent
GO:00435652.6e-23sequence-specific DNA binding
GO:00037002.6e-23sequence-specific DNA binding transcription factor activity
GO:00055151.2e-21protein binding
KEGG pathway 
InterPro domain[190-276] IPR0122876.7e-24Homeodomain-related
[216-278] IPR0013562.6e-23Homeobox
[207-285] IPR0090571.2e-21Homeodomain-like
Orthology groupMCL25371 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215314-TA
ATGGTCGTTTCTTTTTTAAAACTGGCGTCTTTATGTGGTCCGCTTTTGTACACCTTACGTAACGCAAATGATGGGGAAGAATTATCTTCAAACAGGTGCGATTCCCCAATAGGGCATAATCCGGAGGAATCCGATATAGATGATGATATATCGGTCGGAGACAATCGTTCAGAGACATCTACTCCCAAAAAGATTCAAGTTAATTATGACAATGACATCGAAAACTACAAGAGAAAAAATGACAATTGTACCAAAGGAGTTGAATCCGATCACGAACGATCAACGCCAACAGAATTTAATTCAAACCGTATATCCAATTCTGAATTCTCTTCACCATATAAAGAGGATGACCATGAATATGAGAGACAAGAAGAGTCCATGAAGTTGTATGAGGGATCCCTGTTTCAATTATATAGACCGGAAAGAGAAGGGAACAAGGAGGGTGAGAGCGCATTTGAAATCGCTGGCTCTATTTCGGACAGCATCTACGGAAGATCAATGGACCTAACCAAAAACTCATTTCAATCGCAGCTGCTCGCTGGTTTTGCTTCCGTTATGGCGGGTAGTTCACAGAGGGATTCACAGGATCCAACAGATCGTCAGTCCTCGGGTCGAAATGGAAGTTCCTCGAACTCCGGTCGCAAGCCTCGTCGGCGTCGCACAGCATTCACACACGCACAGTTGGCCTACTTGGAGCGCAAGTTTAGGTGTCAGAAATACTTGAGCGTCGCTGACCGAGGGGACGTCGCTGATGCTCTCAACCTCAGTGAAACACAAGTTAAAACGTGGTATCAGAACAGAAGAACAAAATGGAAGAGACAAAACCAGCTTCGTCTCGAGCAGCTTCGGGCGCAAGCGGCTAGCGGCGAACGCGAGCTGTCTGCGCACGCGCTGCCTCTAGCGTGCGCGCTGCTGCCGCCCTACCCCGCTTATATCCACTGTCACCTATGA

Protein sequence:

>DPOGS215314-PA
MVVSFLKLASLCGPLLYTLRNANDGEELSSNRCDSPIGHNPEESDIDDDISVGDNRSETSTPKKIQVNYDNDIENYKRKNDNCTKGVESDHERSTPTEFNSNRISNSEFSSPYKEDDHEYERQEESMKLYEGSLFQLYRPEREGNKEGESAFEIAGSISDSIYGRSMDLTKNSFQSQLLAGFASVMAGSSQRDSQDPTDRQSSGRNGSSSNSGRKPRRRRTAFTHAQLAYLERKFRCQKYLSVADRGDVADALNLSETQVKTWYQNRRTKWKRQNQLRLEQLRAQAASGERELSAHALPLACALLPPYPAYIHCHL-