Monarch geneset OGS2.0

DPOGS214852
TranscriptDPOGS214852-TA1188 bp
ProteinDPOGS214852-PA395 aa
Genomic positionDPSCF300091 - 118240-122992
RNAseq coverage5x (Rank: top 88%)
Annotation
HeliconiusHMEL0150164e-12789.17% 
BombyxBGIBMGA010018-TA2e-8889.51% 
DrosophilaCG6454-PC1e-9168.42% 
EBI UniRef50UniRef50_G6CZK92e-12799.53%Putative uncharacterized protein n=4 Tax=Coelomata RepID=G6CZK9_DANPL
NCBI RefSeqXP_395128.32e-9471.37%PREDICTED: similar to CG6454-PA, isoform A, partial [Apis mellifera]
NCBI nr blastpgi|3287873098e-9569.58%PREDICTED: uncharacterized protein KIAA0528-like [Apis mellifera]
NCBI nr blastxgi|3287873095e-9069.87%PREDICTED: uncharacterized protein KIAA0528-like [Apis mellifera]
Group
Gene OntologyGO:00055152.3e-23protein binding
KEGG pathwaycfa:4898193e-08 
 K03008 (RPB11)maps-> Huntington's disease
    Purine metabolism
    Pyrimidine metabolism
    RNA polymerase
InterPro domain[1-134] IPR0089732.3e-23C2 calcium/lipid-binding domain, CaLB
[6-90] IPR0000083.2e-17C2 calcium-dependent membrane targeting
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214852-TA
ATGCCGGGTAAAATAAAAGTGAAGGTGTTAGCGGGTCGCAATCTGCCAGTGATGGACAGGGCTAGTGACACCACTGATGCCTTTGTGGAAATTAAGTTCGGAGGAGTCACCCACAAAACGGATGTATGTCGGAAATCGCTGAATCCTCATTGGAACAGCACCGAGTGGTATAGGTTTGAGGTTGACGAATCAGAGCTCCAAGATGAACCGCTTCAGCTGCGACTGATGGATCACGATACATATTCAGCTAACGACGCCATCGGGAAGGTGGTCATCAGTCTTGCTCCTCTCCTAGCTCGGGAGGCGAATAACGCCAACGGTACCACCGGCCCACCTGGTGGCGCTGTCATGTCAGGGTGGATACCAGTCTTCGACACGATGCACGGCACTCGCGGCGAACTGAACATCATCGTCAAAGTTGAACTTTTCTCTGACTTCAATAAATACAAAACATCAAGCTGTGGGGTTCAGTTCTTTCATTGCCCAATGATCCCTCCTGGATATAGAGCCACGGCCATCCACGGCTTCGTCGAAGAGCTCATTGTGAATGATGATCCAGAGTATCAGTGGATCGACAAGATCAGGACACCCCGGGCTTCTAACGAGGCGAGACAGGTCGCCTTCATTAAACTCAGCAATCAAGCGGATTCGTTAGATTCAACAGATTCTCTAGACATGGCGGTGAAGCAAATGAAGCCAGATCAAGACCTCAACGACGCGGTCATAGTTCAAGATAGTGACACAAGCAACACCAGTATTAATACAATGGTGAATAGAATAGAAAATTCAGAATCACCGTACTTAAGTTTAAAACAAAGCAGCCTGGCGCTGACCGCCTTACAAAGACATCCCACACATCCGCTACCTGATAGGAACCTTAAGCCATCAGAACTAGCTCTGTTTAATTCAGCTCGGAACGCGATCACCAAGAAACCAGATATACAGAGGACAAGCATCTTCCACCAGAAGGAACCGATTAAGATACAATTATCTTCAGAGAATGACCCAAATATGTCATTACTAAACATATCCTATGACGGCCTGGGGGGTCATTCTAAACATGACACTTTCCCCTCACCGAGACAATCACTGAAAAGTTGCATTCCAAAACTATCATCCAAACACAAATTAAAACGACCAAATCCGATTGGAAGCAGAGATATAAGCAAAATGCTGAAGAAAACTTAA

Protein sequence:

>DPOGS214852-PA
MPGKIKVKVLAGRNLPVMDRASDTTDAFVEIKFGGVTHKTDVCRKSLNPHWNSTEWYRFEVDESELQDEPLQLRLMDHDTYSANDAIGKVVISLAPLLAREANNANGTTGPPGGAVMSGWIPVFDTMHGTRGELNIIVKVELFSDFNKYKTSSCGVQFFHCPMIPPGYRATAIHGFVEELIVNDDPEYQWIDKIRTPRASNEARQVAFIKLSNQADSLDSTDSLDMAVKQMKPDQDLNDAVIVQDSDTSNTSINTMVNRIENSESPYLSLKQSSLALTALQRHPTHPLPDRNLKPSELALFNSARNAITKKPDIQRTSIFHQKEPIKIQLSSENDPNMSLLNISYDGLGGHSKHDTFPSPRQSLKSCIPKLSSKHKLKRPNPIGSRDISKMLKKT-