Monarch geneset OGS2.0

DPOGS202527
TranscriptDPOGS202527-TA1539 bp
ProteinDPOGS202527-PA512 aa
Genomic positionDPSCF300131 + 352545-357810
RNAseq coverage1642x (Rank: top 8%)
Annotation
HeliconiusHMEL0074370.090.61% 
BombyxBGIBMGA001551-TA0.089.29% 
DrosophilaUbqn-PA1e-17258.46% 
EBI UniRef50UniRef50_Q9VWD92e-17058.46%LD38919p n=18 Tax=Endopterygota RepID=Q9VWD9_DROME
NCBI RefSeqXP_001652587.10.066.47%ubiquilin 1,2 [Aedes aegypti]
NCBI nr blastpgi|1571153920.066.47%ubiquilin 1,2 [Aedes aegypti]
NCBI nr blastxgi|1571153920.066.60%ubiquilin 1,2 [Aedes aegypti]
Group
Gene OntologyGO:00055153.3e-15protein binding
KEGG pathwayaag:AaeL_AAEL0071600.0 
 K04523 (UBQLN, DSK2)maps-> Protein processing in endoplasmic reticulum
InterPro domain[6-513] IPR0154966.3e-191Ubiquilin
[17-82] IPR0006263.3e-15Ubiquitin
[460-509] IPR0090604.3e-15UBA-like
[169-208] IPR0066361.9e-07Heat shock chaperonin-binding
[467-503] IPR0004499.1e-07Ubiquitin-associated/translation elongation factor EF1B, N-terminal
[467-505] IPR0159402.5e-06Ubiquitin-associated/translation elongation factor EF1B, N-terminal, eukaryote
Orthology groupMCL12023 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202527-TA
ATGGCAGAAGGCCAGGAGGAACCTAAAAAGATTACAATTACTGTAAAAACACCAAAAGAAAAGCAGCAAGTTGAAATCGAAGAAGATGCAGATATCAAAAAACTCAAAGAAGTGTTGTCCCCTAAATTTAACGCGGAGCCCGAACAGCTATGTTTAATTTTTGCCGGAAAAATTATGAACGATTCAGATACTATGAAGCAACATAACATCAAAGATGGGTTGACAGTTCATCTTGTTATCAAGACTCCTCCAAGACCTGAACCGGAAGGTGGAACACGGCGCCCTCCAGCTGATATTGGTGCTACACCTTTTGGACTGAACTCTCTTGGGGGTCTAGCAGGCTTAGAAAGCCTTGGTCTAGGCCAAAGCACTTTTATGGACCTGCAAGCTCGTATGCAACAAGAGCTTTTGTCGAACCCTGATATGTTACGACAAGTGCTGGATAACCCACTCGTTCAGCAAATGATGAACGATCCTGAGAATATGCGAACCCTTATTACATCCAACCCGCAGATGCAAGATTTGATGGCTAGGAACCCTGAAATTAGTCATATGTTGAACAACCCTGAACTGTTACGACAAACAATGGAATTGGCACGCAATCCTGCCATGCTTCAAGAGTTGATGAGGTCCCATGACCGAGCTTTGTCCAACTTGGAGAGTATACCTGGTGGTTACAATGCTTTGCAGCGAATGTATCGAGACATCCAAGAACCAATGTTGAATGTAGCCAGTAGCATGGCTGGAAATCCATTCTCTGGACTAGTAGACAATTCAGATGGCACCAATCCCCAACAGGGGGCAGAGAACCGTCAGCCCCTTCCAAACCCTTGGCAGCGTGGAGGTTCTAATGCATCTAGCACACCAAACACAGGCCCAGGCCTTATCAATACACCTGGCATGCAGTCATTGCTACAACAGATGTCTGAAAATCCTCGTCTTGTACAATCAATGCTATCAGCACCATACACTAATAGTATGCTACAAGCTCTCGCTGCCGACCCGGAGATGGCATCTCAACTTATTAACCAGAATCCCATGTTTGCCAATAATCCACAACTGCAAGAACAGATTCGTACTATGATGCCACAAATGCTAGCCCAGCTGCAGAATCCAGAAATGCAACAGATGATGTCTAATCCACAGGCGCTGAATGCCCTACTTCAGATCCAGCAGGGTATGGAACAATTGCGAGCGGCGGCACCAAGTCTGGTCAATAATATGGGCTTCGGAGCAGCCGCTGCCACTGCCGCCCCACCCCCACCTCCCACTACTAACACACCGCCAGCACAAGCGAGACAACAACAGAACTCTGAGCTGTTCACACAGTTCATGCAAAGAATGGTATCGGCGATGGCCAACAACCAGACCAACACTCAGCAACCCCCGGAACAACGCTACTCACAACAGCTAGAGCAACTTGCAGCCATGGGTTTCCTCAACAGGGAGGCTAATTTACAAGCACTGATCGCAACATTTGGTGACGTGAACGCGGCAGTTGAAAGGCTACTGGCTCTAGGTCAACTGTCCATGAGCTAA

Protein sequence:

>DPOGS202527-PA
MAEGQEEPKKITITVKTPKEKQQVEIEEDADIKKLKEVLSPKFNAEPEQLCLIFAGKIMNDSDTMKQHNIKDGLTVHLVIKTPPRPEPEGGTRRPPADIGATPFGLNSLGGLAGLESLGLGQSTFMDLQARMQQELLSNPDMLRQVLDNPLVQQMMNDPENMRTLITSNPQMQDLMARNPEISHMLNNPELLRQTMELARNPAMLQELMRSHDRALSNLESIPGGYNALQRMYRDIQEPMLNVASSMAGNPFSGLVDNSDGTNPQQGAENRQPLPNPWQRGGSNASSTPNTGPGLINTPGMQSLLQQMSENPRLVQSMLSAPYTNSMLQALAADPEMASQLINQNPMFANNPQLQEQIRTMMPQMLAQLQNPEMQQMMSNPQALNALLQIQQGMEQLRAAAPSLVNNMGFGAAAATAAPPPPPTTNTPPAQARQQQNSELFTQFMQRMVSAMANNQTNTQQPPEQRYSQQLEQLAAMGFLNREANLQALIATFGDVNAAVERLLALGQLSMS-