Monarch geneset OGS2.0

DPOGS207294
TranscriptDPOGS207294-TA1131 bp
ProteinDPOGS207294-PA376 aa
Genomic positionDPSCF300008 + 665891-667315
RNAseq coverage519x (Rank: top 24%)
Annotation
HeliconiusHMEL0163344e-16276.71% 
BombyxBGIBMGA012025-TA7e-14868.63% 
DrosophilaCG12608-PA9e-6539.18% 
EBI UniRef50UniRef50_E2BED94e-6443.85%p21-activated protein kinase-interacting protein 1-like n=7 Tax=Formicidae RepID=E2BED9_HARSA
NCBI RefSeqXP_001963842.17e-7241.10%GF21235 [Drosophila ananassae]
NCBI nr blastpgi|1947634431e-7041.10%GF21235 [Drosophila ananassae]
NCBI nr blastxgi|1954323102e-7139.22%GK19853 [Drosophila willistoni]
Group
Gene OntologyGO:00055151.3e-38protein binding
KEGG pathway 
InterPro domain[4-299] IPR0159431.3e-38WD40/YVTN repeat-like-containing domain
[35-296] IPR0110462e-38WD40 repeat-like-containing domain
[68-102] IPR0197812.1e-06WD40 repeat, subgroup
Orthology groupMCL11833 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207294-TA
ATGGATGAAATAGTCGTGGGCACCTATGAAGGCTTTTTACTAGGATATTCACTGCAACCTGAGGACGATAGAACTATTCTGAAACAAACTTTCGCAACCTCTTCCCACACTGCAGCCGTAAGATGTTTATCTATAGCTGGAAAATTTTTAGCATCGGGTGGAACAGATGACAAGGTTGTAATTATAGACCTGAAAACCCGAAAAGAACACACTGTTCTTATGAATCATGATGGAACCGTTAACACCGTTGCATTTACAAATGGGGGGACACATCTATTAACTGGAAGTGATGATGGATCTGTTATTGTTACTAGAACTGGCAATTGGCAGATAGAGAAAATATGGAAAAAGGCTCATGGAGGTCAACCAGTTACAACTATAGTGGTTCATCCTTCTGACAAGTTAGCATTAAGTATAGGTGGTGATAAGACCCTGAGAACATGGAATTTAATTAAAGGTCGACCAGCATTTACTATTAATCTTGGAAGTAAAGGAGTTGGTTTGCCTACAGAGATAAAGTTTTCACCGAGTGGTGATAGATTTTCCCTTATATCCTTACAAAATGTTGATGTGTGGACTATAAGTAAGGCAGGATTGGAAAAACGTTTAACATGCAACTCCAAACCATCAACAACTCAATGGACCAATGACGATGAACTGTTTGTTGGATTGGAAAATGGGAATATTATTAAATTCACCGTATCTGAAACCAAAGCACAGACTTATCAGGCTTACAAGCAAAGAGTAAAATGTATGCATTACGAAAATAGCAACTTATACTCTGCATCCAGTAACGGAGTGTTAAAAGCTTGGCATGTTGATGATGATAATTTACAAGAGATATGCTCTACAAATATATCATGTAGAGTCACTTGTATTGCCTTAAACAGACAACGTCATTTAATAAAGAAGGAGGAGAATTGTGAAGATGACGGTAAGGCTTCAGGGCTGTCAGATAACGAAGATAAGAGTAATGAAAGTGATGACTCTGAAATTGTAGAAAGGCCACCAGCTAAGAGGCAGCCAGGAGCATTTGTGAAAATCAGTTATGACGAGAACAACATAGAATCACCTCCAGCCAAGAAAACTAAGAGAAAGAAGAAAAACAAAAAGAAAAATAAAACTGAATAA

Protein sequence:

>DPOGS207294-PA
MDEIVVGTYEGFLLGYSLQPEDDRTILKQTFATSSHTAAVRCLSIAGKFLASGGTDDKVVIIDLKTRKEHTVLMNHDGTVNTVAFTNGGTHLLTGSDDGSVIVTRTGNWQIEKIWKKAHGGQPVTTIVVHPSDKLALSIGGDKTLRTWNLIKGRPAFTINLGSKGVGLPTEIKFSPSGDRFSLISLQNVDVWTISKAGLEKRLTCNSKPSTTQWTNDDELFVGLENGNIIKFTVSETKAQTYQAYKQRVKCMHYENSNLYSASSNGVLKAWHVDDDNLQEICSTNISCRVTCIALNRQRHLIKKEENCEDDGKASGLSDNEDKSNESDDSEIVERPPAKRQPGAFVKISYDENNIESPPAKKTKRKKKNKKKNKTE-