Monarch geneset OGS2.0

DPOGS207614
TranscriptDPOGS207614-TA1401 bp
ProteinDPOGS207614-PA466 aa
Genomic positionDPSCF300248 + 209149-212671
RNAseq coverage171x (Rank: top 50%)
Annotation
HeliconiusHMEL0078651e-13463.10% 
BombyxBGIBMGA006356-TA5e-12259.60% 
Drosophilamod(mdg4)-PE1e-3456.14% 
EBI UniRef50UniRef50_Q6IE028e-4559.85%Mod(Mdg4)-heS00531 n=1 Tax=Bombyx mori RepID=Q6IE02_BOMMO
NCBI RefSeqNP_001106229.11e-4559.85%Mod(mdg4)-heS00531 [Bombyx mori]
NCBI nr blastpgi|1638386923e-4459.85%Mod(mdg4)-heS00531 [Bombyx mori]
NCBI nr blastxgi|2700117762e-4341.43%hypothetical protein TcasGA2_TC005851 [Tribolium castaneum]
Group
Gene OntologyGO:00055155.6e-24protein binding
KEGG pathway 
InterPro domain[2-116] IPR0113332.7e-29BTB/POZ fold
[23-116] IPR0130695.6e-24BTB/POZ
[30-125] IPR0002102.5e-22BTB/POZ-like
[386-446] IPR0075881.5e-13Zinc finger, FLYWCH-type
Orthology groupMCL25230 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207614-TA
ATGGACCAACAATATTCATTGTCTTGGAATAATTTCCACGGAAACCTGAGCAGGGGATTCGCCGGTTTATTAGGCAATGGTGAATTTGTTGATGTAACGATTGCTGTCGAAGGGCATTTGCTGCAAGCACATAAAGTTATTCTATCAATATGCTCACCATATTTCAAAAAAATGTTTCAATTGAATCCATGTCAGCATCCTATTGTAGTTTTAAGAGATGTCACTCATAAAGCCATGAGAGATTTATTACAATTCATGTATCATGGTGAAGTAAGTGTGAAAAGAGAAGATCTCACTAGTTTTATTGGTACAGCTGAAGTTCTGCAAATAAAAGGATTAACAACTAAAGAGACTGATGAAGAGGTGTTTGATACAGAAAAGGAATCAACAAAACAAAGCCTAGGACCTCAAAATGATAGTCCTGATGCTGAATCATACACTTCAGACATTTATGACCCAAACATCAATGAAGTAACAGACAACAATGAAGATTTAATTAAGAAAAGACAGTTATTAGAAAAAATACACAGGCTGAGTAGTTTGAAACGAAAATCTGATGAATTTTTACAAAAAAACTATTACAGTGATCCTATCACAGAGAAAAGAGCAAAAATTGCTGATAAAAAACTGTACAAAGTGAATAATCATTTGTTGCAAAGCGGCAATTATGAAAACAAGAATGTATATGATAACAATCCTACAGATGTAACTAATACATTAAATCAAATTCAAAATTCTGGTTCTGATAAGGAACTATCAGATGGTGGTCAAAATCCAGCTGCCGATAATACCATAGAATTAAAAACTGAGTGTGATGACAGCGATATCACAGTCATTGACCCGGCTGTTTGCACAACGAAAGACTTCAGTTGTTCCAGAGATGGTAAAAAAGGATTATCAATACCATTGGAATATCAATCACCGCAGGAAAATGAACCGGTACACGGTTTGAACGCATTTTTTATGGATCTCAAAAACTTAGTTAATGGCAAAATTAATAACACTTTGAGGAGTGAGGAATTTCCGAGGCATTCCCTAGAACATCGTCTTATTACAATACCACAAAAGGAAGATGGGAAAAAGAAATATCCCCAAAGATCGTATGCTCTGTCACTTGTCATGAATGGATCTTGCGAACTGCCAGTACGATACAAATATATATACAGCAGGAAAGGACACAAACAACTCGTACATATGAACTTTGTGTATACCAAGCATTCAACAACACACGGTAAAACGTCATGGAGATGCGTTCAATACTTCTCACTGAATAGATGTCCGGCGACGGTGGAAACTATTGATTCCATGATATACGCCGTCAATCACCAACATAACCATGAAGATTGCTATGAAAAATTATCGAGAAATAACATATATGAAATGAATGTATCAGCCAAATAA

Protein sequence:

>DPOGS207614-PA
MDQQYSLSWNNFHGNLSRGFAGLLGNGEFVDVTIAVEGHLLQAHKVILSICSPYFKKMFQLNPCQHPIVVLRDVTHKAMRDLLQFMYHGEVSVKREDLTSFIGTAEVLQIKGLTTKETDEEVFDTEKESTKQSLGPQNDSPDAESYTSDIYDPNINEVTDNNEDLIKKRQLLEKIHRLSSLKRKSDEFLQKNYYSDPITEKRAKIADKKLYKVNNHLLQSGNYENKNVYDNNPTDVTNTLNQIQNSGSDKELSDGGQNPAADNTIELKTECDDSDITVIDPAVCTTKDFSCSRDGKKGLSIPLEYQSPQENEPVHGLNAFFMDLKNLVNGKINNTLRSEEFPRHSLEHRLITIPQKEDGKKKYPQRSYALSLVMNGSCELPVRYKYIYSRKGHKQLVHMNFVYTKHSTTHGKTSWRCVQYFSLNRCPATVETIDSMIYAVNHQHNHEDCYEKLSRNNIYEMNVSAK-