Monarch geneset OGS2.0

DPOGS215621
TranscriptDPOGS215621-TA1239 bp
ProteinDPOGS215621-PA412 aa
Genomic positionDPSCF300041 - 2072157-2095178
RNAseq coverage2042x (Rank: top 6%)
Annotation
HeliconiusHMEL0059301e-8677.78% 
BombyxBGIBMGA003682-TA4e-8473.53% 
Drosophilayki-PF5e-4940.06% 
EBI UniRef50UniRef50_B2CMB60.079.63%YAP65-like protein n=2 Tax=Endopterygota RepID=B2CMB6_BOMMO
NCBI RefSeqNP_001116819.10.079.63%YAP65-like protein [Bombyx mori]
NCBI nr blastpgi|1825102160.079.63%YAP65-like protein [Bombyx mori]
NCBI nr blastxgi|1825102160.081.01%YAP65-like protein [Bombyx mori]
Group
Gene OntologyGO:00055155.1e-17protein binding
KEGG pathwaycim:CIMG_071202e-20 
 K10591 (NEDD4, RSP5)maps-> Ubiquitin mediated proteolysis
    Endocytosis
InterPro domain[134-170] IPR0012025.1e-17WW/Rsp5/WWP
Orthology groupMCL13429 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215621-TA
ATGGCTCTCAATTCGGACGGTGAACAGAAATCGAACCTGGTCCTCCGCGTAGACCAGGATTCTGACTCGGTTTTGCAGTCACTATTCGACACAGTGCTTAAGCCTGACTCAAAACGACCGCTACAGGTGCCTCTGCGTATGCGACAGCTTCCAAAGTCGTTCTTTAACCCGCCGTCGACCGGTTCCAAGTCGCCATCTGTGTCTCATTCGCGTGAAAATTCGGCTGATTCTGCGTTTGGGTCTTCATCTGCAACTGGCACAGCTCCCGTGTCCCATTCTCGTGCACACTCGTCTCCAGCTAGTTTACAACAGACATATGCTGCAGGTCAACAGAATCAGCAACCACCTTTACCGCATCAACACGCAAAACAGAGATCCTACGATGTTGCTTCACATATTCCAGATGAACTGGGACCTTTACCAGCTGGATGGGAACAAGCTCGCACTCCAGAGGGACAGATATATTATCTCAATCACATAACCAAGACGACCACATGGGAGGACCCTCGGAAGACATTGGCGGCGCAAACAGTGTCCGCAGGCGTGCAACATCAGAACGAGATACTCACCCCACCACAGACTATAGCAGCCACCCCGACTGCAGCGAAGAGTACAAGCACTAACACGACGACAGATCCCCTCGGACCTCTGCCGGACGGCTGGGAACAGGCGACTACACCAGAGGGAGAGATATACTTCATCAATCACGCAGCCCGAACTACATCCTGGTTCGATCCCAGAATACCACAACACTTGCAGCGCACTCCCACGTCAGGCGGCGCGGCCGGCGGCGGCTGGGCGAACGCGGCGTGCCAACAGAAACTGAGGTTGCAGTCGCTGCAGCTGGAAAGAGAGAGACTCAAACAGCGCCAGCAGGAGATACGACTCCAGCAAGAGCTGATGGCGCGGCAGTCGTCGTCTATAGTGTCGTCGTTGGCCACCACGGAGGTGTCTCTGGACCCGTTCCTGTCGGGCATCAGCGAGCACACGCGCCAGGAATCAGCTGACAGCGGGCTGGGCATGGTACAGTACTCAGTGCCTCACACGCCGGAGGACTTCCTCGCCAGTATGGACGACCGTATGGACGTGGCCAGCGAACCCGGCGCGCCCATGGACGCTGACATCACACTCGGAGACACTGATCTGCTCGGAGACTTCACAAATGATATTCTAGACGACGTACAGTCTCTGATAAATTCGACGCCTAACAAGCCAGATAACGTTTTAACCTGGCTGTAG

Protein sequence:

>DPOGS215621-PA
MALNSDGEQKSNLVLRVDQDSDSVLQSLFDTVLKPDSKRPLQVPLRMRQLPKSFFNPPSTGSKSPSVSHSRENSADSAFGSSSATGTAPVSHSRAHSSPASLQQTYAAGQQNQQPPLPHQHAKQRSYDVASHIPDELGPLPAGWEQARTPEGQIYYLNHITKTTTWEDPRKTLAAQTVSAGVQHQNEILTPPQTIAATPTAAKSTSTNTTTDPLGPLPDGWEQATTPEGEIYFINHAARTTSWFDPRIPQHLQRTPTSGGAAGGGWANAACQQKLRLQSLQLERERLKQRQQEIRLQQELMARQSSSIVSSLATTEVSLDPFLSGISEHTRQESADSGLGMVQYSVPHTPEDFLASMDDRMDVASEPGAPMDADITLGDTDLLGDFTNDILDDVQSLINSTPNKPDNVLTWL-