Monarch geneset OGS2.0

DPOGS211731
TranscriptDPOGS211731-TA1116 bp
ProteinDPOGS211731-PA371 aa
Genomic positionDPSCF300239 + 157707-178589
RNAseq coverage205x (Rank: top 47%)
Annotation
HeliconiusHMEL0173428e-2360.66% 
BombyxBGIBMGA013977-TA2e-6888.74% 
Drosophilahomer-PC4e-9451.89% 
EBI UniRef50UniRef50_E1ZYA82e-10252.49%Homer protein-like protein 1 n=8 Tax=Formicidae RepID=E1ZYA8_CAMFO
NCBI RefSeqXP_002428225.16e-11156.89%homer, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420151271e-10956.89%homer, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2700108972e-11058.73%hypothetical protein TcasGA2_TC015941 [Tribolium castaneum]
Group
Gene OntologyGO:00055158.1e-49protein binding
KEGG pathway 
InterPro domain[8-120] IPR0119938.1e-49Pleckstrin homology-type
[7-112] IPR0006971.9e-34EVH1
Orthology groupMCL12579 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211731-TA
ATGGACTCTCAAACGGAAAAAACAGAGCAGCCGATCTTCACGTGCAAGGCCCACGTGTTCCACATAGACCCGAAGACGAAGAGATCTTGGATGTCAGCTAGTTCAGCGGCTGTTTCGGTGTCATTCTTCTACGATTCATCCCGTAATCTGTACCGGATCATCAGCGTTGAGGGCACCAAGGCGGTGATAAACAGCACGATCACAGCCAACATGACCTTCACTAAAACCTCCCAGAAGTTTGGACAGTGGAGCGACGTGCGAGCCAACACCGTATACGGGCTGGGATTTGCATCGGAGGCGGAATTAGGAAAATTCATAGAAAAGTTCCAAGAGGTGAAAGAGGCGATCCAAGCCCAGCAGTGCGCTGCCACTAAGGCCACTAACGGCTCAGGCGCCGCCACGCCAGTTGCCTCTGCCACCGCCAGTCCCCTACTGGCAGCTCGGGCAGCGGCTGACGAACCACCGGCGCTGTCTCCGGCACAGCCGAAGGTGGAAAACGATGAGATGATGGTACATCAACGAGCTCACTCCGTCTCCTCCTCTCTACAGGGTGGCTACGCGACCGTAGGCCGTTCGCCCCGCCCACCAGCTACGGCTACGAGTGCTACGGACGTAGACGACACACAGCTCCGATACGAAAATGATAGGCTGAAATTAGCTTTGGCACAGAGTTCTGCGAACGCGAAGAAATGGGAGGTAGAGCTGGCTACCCTCAAGAGTAATAACCTCCGTCTGACGGCCGCCCTGCAGGAGAGTACAGCCAATGTTGATGAATGGAAGAGACAACTACATCAGTATAGAGAGGAAGTCGCCAGGGCGCGACATTACGCTACTGGCAAAGGTGGTGACGCTAACGAGGTGGAACAGCTAAGGCAGCGTGTTGCGCAGCTTGAGGCTGAACTGGCGCAGAAGAACGAAGAATTGGCTCAGATCACCAAGTCTAAGAAGAGTGAACAGGACGCTGAAGCCAAATTGGCTCAGAGTCAACTGGAGTTAGCGCTGGCGGCTCAGGACGGGCAGCGGCAGGTACTGACGGCGCTAAACGATCAGCTCGCACGGCAGATAGAAGAACTGAGCAACGTCCATCGTGAGATCAGCACCGCTTTACAGACATGA

Protein sequence:

>DPOGS211731-PA
MDSQTEKTEQPIFTCKAHVFHIDPKTKRSWMSASSAAVSVSFFYDSSRNLYRIISVEGTKAVINSTITANMTFTKTSQKFGQWSDVRANTVYGLGFASEAELGKFIEKFQEVKEAIQAQQCAATKATNGSGAATPVASATASPLLAARAAADEPPALSPAQPKVENDEMMVHQRAHSVSSSLQGGYATVGRSPRPPATATSATDVDDTQLRYENDRLKLALAQSSANAKKWEVELATLKSNNLRLTAALQESTANVDEWKRQLHQYREEVARARHYATGKGGDANEVEQLRQRVAQLEAELAQKNEELAQITKSKKSEQDAEAKLAQSQLELALAAQDGQRQVLTALNDQLARQIEELSNVHREISTALQT-