Monarch geneset OGS2.0

DPOGS208685
TranscriptDPOGS208685-TA1827 bp
ProteinDPOGS208685-PA608 aa
Genomic positionDPSCF300043 - 557796-559969
RNAseq coverage31x (Rank: top 75%)
Annotation
HeliconiusHMEL0152090.084.53% 
BombyxBGIBMGA003334-TA0.079.32% 
Drosophilahb-PB7e-7256.68% 
EBI UniRef50UniRef50_O183267e-17381.72%Protein hunchback (Fragment) n=1 Tax=Bombyx mori RepID=HUNB_BOMMO
NCBI RefSeqXP_001863240.12e-10940.62%hunchback [Culex quinquefasciatus]
NCBI nr blastpgi|111327633e-17281.72%unnamed protein product [Bombyx mori]
NCBI nr blastxgi|111327630.081.72%unnamed protein product [Bombyx mori]
Group
Gene OntologyGO:00036764.6e-07nucleic acid binding
KEGG pathway 
InterPro domain[260-300] IPR0130874.6e-07Zinc finger, C2H2-type/integrase, DNA-binding
Orthology groupMCL15700 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208685-TA
ATGCTGAGCTGTGCATCACCCAGTATGGCCGCGCCGCACGCGCAGCCCTGGGGCTCACTCTTACAACAACCCATTAAATCGGAACCGATGGAAGACGGAAGTTTCTCAAAGGAACAAGCGAGCGGTTTTTACTCAGAAGGCTTTCATAGCGCATCGCCTTCGTCATCCAGCAAGGACTCCAATGGGCACTCGCCGCGCAGCGTCGGTAGTTCCGGAGAACCGTCCCCTTTCTACGATAATATGCCTTTAAAAGCAAAGGCTAACCTCGGGATGCATTTGGAATCATACCGTAACGGCTTGCCCTACAGTCTGCTCACGCCGCCTGGGTTTGAAAACCGACATAACGAAGAACACGAACAGTCGCCATACAGCTCGTATTCCCCCCGGTCTATAGCACCACCGGCTCACGTTTCCACACCTTTAGCCCGCGCTGATGCCACACCGCCGAAGTCTCCGCCGCAAACTCCATCATCCCCTCTTAGAGAACATGAAAGAAGAGCATTCGAAAGATTCCACGACTCTGGTTTCGACGGCATCGGACAAAATAAATCTGATGCTGACGACGGTCGGGATGGATCTGGACTCGAAGAAGATTTTGATGAAGAGCCCGGCCTACGCGTGCCGGCGGTCAACTCGCACGGAAAAGTCAAAACATTCAAATGCAAACAATGCGAATTTGTTGCTGTTACAAAATTGAGTTTCTGGGAGCACAGCAAAGAGCACATCAAGCCCGAAAAAATGCTTACGTGTAGAAAGTGCCCATTCGTCACTGAATACAAACACCATCTCGAATATCACATGAGAAACCATTTAGGCTCCAAGCCCTTCCAATGTTCTCAGTGCTCTTACTCTTGCGTCAACAAATCTATGTTAAATTCACACCTGAAATCTCATTCAAACATCTACCAATACAGATGCGCTGATTGTAACTATGCTACCAAATACTGTCATTCTCTCAAACTTCACCTACGGAAGTATAAACACAACCCAGCGATGGTGCTGAACATGGATGGTACACCGAACCCTTTGCCAATAATCGATGTGTACGGAACACGACGTGGTCCTAAACAAAAGCCATTAATGAAGATGTACGATCAGCAGCAGATGAATAACAAGCCACAACCTCTCCCACCCCAGCATCCAATTTTCGGAAATCACTTCCCGGTGAATCTGCCATACTTACCGCCACTTCTGCCACACTCGTTCTTGTTTCCGCCAAATAATAATTACGAACAGAGGACGTCGCCTAAAGTGACTGAAACATCGGTTGAAAATCAGCCATCTACTTCACCTCAATCGATATTACAACAGCGCTTGTCTTATGGCGAATATCCTTCGGAAGCAGGTGCCACGCCACCACCAACTAAATCACCCACAATCTTACCACAAACCCCCACAAAACGTACGCTGACACCACCTCAAACCACTGACGCTCTTGACTTGACAAATACCAAAACGAGCGAGGCAGGATCGCCTCCGCCCATAGAACCACCAGCGCCTGTCACGCCCACAACGGCCTTGAAGAACAGAAGAAAAGGAAGAGCATTCAAACTCCAACCAGCAGCTTTGAGATTACAGCATGAAGATACTAAAATGGAGGCGGACAACTCGGATTCGGAATCCGACGCTTCAGCTGAACCAACACCGAGTGCGCCAACATCGTACACCTGCCAATACTGCGACATAACGTTCGGGGATCTCACCATGCACACCATACACATGGGTTTCCATGGATACAACGATCCCTTCATGTGTAACAAATGCGGCGAAAGAAGCTCCGACCGCATAGCTTTCTTCATACACTTAGGACGCGCCCAGCATGCCTAA

Protein sequence:

>DPOGS208685-PA
MLSCASPSMAAPHAQPWGSLLQQPIKSEPMEDGSFSKEQASGFYSEGFHSASPSSSSKDSNGHSPRSVGSSGEPSPFYDNMPLKAKANLGMHLESYRNGLPYSLLTPPGFENRHNEEHEQSPYSSYSPRSIAPPAHVSTPLARADATPPKSPPQTPSSPLREHERRAFERFHDSGFDGIGQNKSDADDGRDGSGLEEDFDEEPGLRVPAVNSHGKVKTFKCKQCEFVAVTKLSFWEHSKEHIKPEKMLTCRKCPFVTEYKHHLEYHMRNHLGSKPFQCSQCSYSCVNKSMLNSHLKSHSNIYQYRCADCNYATKYCHSLKLHLRKYKHNPAMVLNMDGTPNPLPIIDVYGTRRGPKQKPLMKMYDQQQMNNKPQPLPPQHPIFGNHFPVNLPYLPPLLPHSFLFPPNNNYEQRTSPKVTETSVENQPSTSPQSILQQRLSYGEYPSEAGATPPPTKSPTILPQTPTKRTLTPPQTTDALDLTNTKTSEAGSPPPIEPPAPVTPTTALKNRRKGRAFKLQPAALRLQHEDTKMEADNSDSESDASAEPTPSAPTSYTCQYCDITFGDLTMHTIHMGFHGYNDPFMCNKCGERSSDRIAFFIHLGRAQHA-