Monarch geneset OGS2.0

DPOGS201452
TranscriptDPOGS201452-TA1701 bp
ProteinDPOGS201452-PA566 aa
Genomic positionDPSCF300006 - 490254-492226
RNAseq coverage725x (Rank: top 18%)
Annotation
HeliconiusHMEL0090950.091.70% 
BombyxBGIBMGA002702-TA0.088.20% 
Drosophilaken-PA3e-6047.74% 
EBI UniRef50UniRef50_F6ME180.088.03%Ken and barbie protein n=2 Tax=Obtectomera RepID=F6ME18_BOMMO
NCBI RefSeqXP_968018.22e-12142.63%PREDICTED: similar to dusky-like CG15013-PA [Tribolium castaneum]
NCBI nr blastpgi|3796989120.088.03%ken and barbie protein [Bombyx mori]
NCBI nr blastxgi|3796989120.088.56%ken and barbie protein [Bombyx mori]
Group
Gene OntologyGO:00055154.5e-12protein binding
GO:00036761.1e-09nucleic acid binding
GO:00082706.2e-05zinc ion binding
GO:00056226.2e-05intracellular
KEGG pathway 
InterPro domain[3-116] IPR0113332e-13BTB/POZ fold
[29-123] IPR0130694.5e-12BTB/POZ
[481-513] IPR0130871.1e-09Zinc finger, C2H2-type/integrase, DNA-binding
[31-129] IPR0002102.6e-08BTB/POZ-like
Orthology groupMCL16049 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201452-TA
ATGGGCGAGGGCCTGCTGACACTCACGTACGGCAAGCACCACGCCTGCATCCTGAACGAGGTAGGAGCGGCGTGGCGCGGCGCTCGTTACTCCGACCTGGTACTGGTGTGTGACGACGCTGTGCCACTTGCCGCACACCGCATCGTCATGGCGGCAGCTAGCCCGTTAATAAGAAAAATCCTCGATGATACCCCGTCGATCGAGAACCCAGTCACAATTCACATGTCTGGCATTAGCAGCACCCTCATGCGACACCTCTTGGTTTTCTTATACAGTGGCCAAGCATACATAGAGTCTGGTGAAATAGATGACATGCAAGAACTGTTTGAGCTGCTAGAAATAAAGTCTGACGTGTGGAAATCAACTAAAGAAAGACAACAAGCAGAAGAAAGAAACAGATCTTGCGAGAGGCTCAAAGGAAAAAACCTTAAAACGGACATCAACAGCAACGAAAGTAGTAAACATGACGCACCATCAGGATCCTCACACTCAGAAAGTGAAAGAGTTACACCAGGGGCAGGTTATAGTAAAGATAAAGATCGAGGAGATTCTGAGGCCCTTAGTATAAAGAAGGAAAATATGTCAACGAGTAGCAACGATGAACGGGAAGACGATGATCATGATGGCGACAAGGGTGATAAAGAATGTGGCAGAGTATCCTCGCGTCGTAGAAGTTCATCAAATCCGGTCAATTTATCTTTAGCCAGAAATTCGGAAAACACAAATAGTGAACACGACGACTCACAAGACTTGGATGTGGAAACAGTTGGAATAAAGAATATTGGGCGGAGACGGTCGACATCTTCAAGTCAAGACGATCAACCACAAGAAAACGTATACAGAAGACAGCTGCTGACTGACAGATTAGGAAGAGGCATTGAGAGAGCAAAACGAAAATCACATTATTTAGAGCCTTCCGAACTGGACCTGAGAACTGTAAAATTAGAAGATTACGCTCACTTGAAATCAACAGAAAATGAATTAATGGCCCTCGTTCAAACTTCTCCTGAGAACTACGTTGTCACACCACACCGGAAAAGAAGACCTGGATTTCATAACTCACCTTCACAGAATCCTCCTTTTGTCTCATTTGCACCAAGCTATCTTGATGAAATGGCACAATTACGTTACGCACAAGGTCAAGGGGTTGCGGCAGGAGTAGCTCGACTTGGAGGCTTACATTCACTTAGTGTTTCTGCCCCTCCATTTCTACCTGAAAGAAGCGCTACTCCACCCGCAGTACCACATGAGGATGCACTTAAATATCGGCCTCCGAGCGCAGGACCTTGGGGACCTTGGCTTTGTCAACCACAAATAGGCGGAGACGACACACCATCCACTGAACACGAACAGGGTTCGAGTAAACAAGCTCCTGTTAGGGAATATCGCTGTGAATACTGTGGTAAACAATTCGGCATGTCATGGAATCTCAAGACTCATCTAAGAGTTCACACAGGCGAAAAACCATTTGCATGTAGACTATGTGTTGCTATGTTTAAACAAAAGGCCCATTTGTTGAAACATTTGTGTTCAGTACATCGGAATGTCATATCGTCAAGCGAAAATGACGGACGAACAAATACTCCTGGACGATTTAACTGCTGTTTCTGTCAGTTAACATTTGAAGCGATGCCAGAACTAATAAGGCATCTCTCTGGACCACACAACAGTTTACTACTCAGTAAAAATTTACATGATTGA

Protein sequence:

>DPOGS201452-PA
MGEGLLTLTYGKHHACILNEVGAAWRGARYSDLVLVCDDAVPLAAHRIVMAAASPLIRKILDDTPSIENPVTIHMSGISSTLMRHLLVFLYSGQAYIESGEIDDMQELFELLEIKSDVWKSTKERQQAEERNRSCERLKGKNLKTDINSNESSKHDAPSGSSHSESERVTPGAGYSKDKDRGDSEALSIKKENMSTSSNDEREDDDHDGDKGDKECGRVSSRRRSSSNPVNLSLARNSENTNSEHDDSQDLDVETVGIKNIGRRRSTSSSQDDQPQENVYRRQLLTDRLGRGIERAKRKSHYLEPSELDLRTVKLEDYAHLKSTENELMALVQTSPENYVVTPHRKRRPGFHNSPSQNPPFVSFAPSYLDEMAQLRYAQGQGVAAGVARLGGLHSLSVSAPPFLPERSATPPAVPHEDALKYRPPSAGPWGPWLCQPQIGGDDTPSTEHEQGSSKQAPVREYRCEYCGKQFGMSWNLKTHLRVHTGEKPFACRLCVAMFKQKAHLLKHLCSVHRNVISSSENDGRTNTPGRFNCCFCQLTFEAMPELIRHLSGPHNSLLLSKNLHD-