Monarch geneset OGS2.0

DPOGS201463
TranscriptDPOGS201463-TA1827 bp
ProteinDPOGS201463-PA608 aa
Genomic positionDPSCF300006 - 250803-257048
RNAseq coverage61x (Rank: top 68%)
Annotation
HeliconiusHMEL0159500.086.03% 
BombyxBGIBMGA002615-TA0.083.20% 
Drosophiladve-PA2e-8745.12% 
EBI UniRef50UniRef50_D6W8972e-17253.32%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6W897_TRICA
NCBI RefSeqXP_967929.13e-17353.32%PREDICTED: similar to defective proventriculus CG5799-PA [Tribolium castaneum]
NCBI nr blastpgi|910921646e-17253.32%PREDICTED: similar to defective proventriculus CG5799-PA [Tribolium castaneum]
NCBI nr blastxgi|910921642e-16953.41%PREDICTED: similar to defective proventriculus CG5799-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055159.3e-16protein binding
GO:00036772.1e-11DNA binding
GO:00063552.1e-11regulation of transcription, DNA-dependent
GO:00435652.4e-08sequence-specific DNA binding
GO:00037002.4e-08sequence-specific DNA binding transcription factor activity
KEGG pathway 
InterPro domain[512-589] IPR0090579.3e-16Homeodomain-like
[230-309] IPR0122872.1e-11Homeodomain-related
[513-579] IPR0013562.4e-08Homeobox
Orthology groupMCL16051 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201463-TA
ATGGAGTATCAACAAAATATTAATAAAGTTGGTAGCAAAGCTGGGGATCCAAGTAAAACCTTGCCTGTCCATTGTGTGGTGGAAGCCGTTGCGTCATTGGAAGAAGGTGTTTGGAGGCGAAGGGCAGTTGTTGAAACTGACAGCTATGTCATCATTCCCGCTGCTACCGCCTTCCATGAACTCGTGCCAGCGGCCATGATGCGACTTGGATACCCCCACGAGCTAGCTGCTTCTGCGAAAGGTTCAGTGGTAATTAATAACTGGAAGCCGTTGCCGTTCGAGCGCATATCCGATGGGCCTTTAGCCACTGTCGGTGAAGTGTTGGGTGAGTTGACGACCGTGGCCACCCTTAGGATCCAACTGTTACGGCCGAGACCCACACCCTTGCAGGATATCAAGGACAAACTCCTGAGACTTTTGCTACTCCAGAGCAGACCCTTGTTGATGTCGACTGGATGTCCTTTAGATGAGGTAACGTTAACTCAGATTTGTCGTGGACAAGATAGGTCTTCGGGTCCTCATAATTTTCATGAACCGACAGAGGAAACTCGTCGTAAATTTGAATCTTGGTGGAGTGCCCAAGTTTCACCTCGCCCCCCACCATTCAGTCCTCGGCGCTACCCATCTCCAGGTCCTAAAAATCGATCCCCTACACTTAACACCATCCCAGACCATCTGCACCCAGCACTACAGACCGTTCAAAACCAATATCCAACACAAAAAACTAGAATGAGAACAAGTTTTGATCCCGAACTAGAACTACCAAAGTTACAACGATGGTTCTCTGAAAATCAACACCCTAGTAGGCAGCAAATCCAACAATATGTCAGAGAGTTAAATAATTTAGAATCAAGACGGGGACGAAAGCCCTTAGACGTCAATAATGTCGTTTACTGGTTTAAAAATGCAAGAGCTGCTCAAAAACGGGCTGAGTTACGTAATATTGGAGGGATAGGGGGACATCTTGGTGTCAACGGCTTTAATAGCAGGAGTCATAGTCCATCGAATGGATCACTAATGGCTGGTAATGATAACTATAGTTCTCATGACCATAATTCTTTGAAGAGTCCCATGCAATTATCAGGAAGTCCTGGTAGATACCCAATGTCAGTTATGTCTGAAGACAATCTTTCAAACCCTGGATCTGATTTGGAAGATGACGGAGTCCATGATATCAAACAAGAGCCAAAGGACTTAAGCAAACAAGAACAAGTGCACTCACCTCAGCGTTCACCGACCAAAAATAGTGACAGCTCTCACAATAATAATAATAACAATGAAGATGAAAATGGAGGCGCCGAAGATCATGACATTCCTTCGGATGAAGAAGTCGTTCAAGAGCGTCATTACAGACCATCGTCTCCTCATCTCGACCGTTTACCGTTTCCAATGGTACCGAATCATCCCATGTTCGGTCACGGTATAATGTACATGAGCCAATACATGGGAGGATTCCCAGGTGTAGGGGGTGTTCCAGGTGAAGGGGCTAGCGGCTTAAATTTAGCCCTAGCAGGCGCGTCTGACGAGCGCCGCAAACGCAATCGTACCTTCATAGACCCCGTCTCTGAGGTTCCCGTGTTAGAGCAGTGGTTTTCAATGAACACACATCCTTCGCACAATCTCATACTTAAATATACAGAAGAGTTGAACAGGATGCCATATAGGCAAAAATTTCCACGACTGGAATCTAAAAATGTTCAGTTCTGGTTCAAGAACCGTCGGGCTAAGTGCAAGAGGCTGAAGATGTCTCTTTACGAGCCGACTTCACCTGGTCATTACTCCCATCCCGGTCATCCACACGCAATTGCTGAAAGAAAATTGGTGTAA

Protein sequence:

>DPOGS201463-PA
MEYQQNINKVGSKAGDPSKTLPVHCVVEAVASLEEGVWRRRAVVETDSYVIIPAATAFHELVPAAMMRLGYPHELAASAKGSVVINNWKPLPFERISDGPLATVGEVLGELTTVATLRIQLLRPRPTPLQDIKDKLLRLLLLQSRPLLMSTGCPLDEVTLTQICRGQDRSSGPHNFHEPTEETRRKFESWWSAQVSPRPPPFSPRRYPSPGPKNRSPTLNTIPDHLHPALQTVQNQYPTQKTRMRTSFDPELELPKLQRWFSENQHPSRQQIQQYVRELNNLESRRGRKPLDVNNVVYWFKNARAAQKRAELRNIGGIGGHLGVNGFNSRSHSPSNGSLMAGNDNYSSHDHNSLKSPMQLSGSPGRYPMSVMSEDNLSNPGSDLEDDGVHDIKQEPKDLSKQEQVHSPQRSPTKNSDSSHNNNNNNEDENGGAEDHDIPSDEEVVQERHYRPSSPHLDRLPFPMVPNHPMFGHGIMYMSQYMGGFPGVGGVPGEGASGLNLALAGASDERRKRNRTFIDPVSEVPVLEQWFSMNTHPSHNLILKYTEELNRMPYRQKFPRLESKNVQFWFKNRRAKCKRLKMSLYEPTSPGHYSHPGHPHAIAERKLV-