Monarch geneset OGS2.0

DPOGS214908
TranscriptDPOGS214908-TA1332 bp
ProteinDPOGS214908-PA443 aa
Genomic positionDPSCF300153 - 314951-318850
RNAseq coverage485x (Rank: top 26%)
Annotation
HeliconiusHMEL0111140.082.81% 
BombyxBGIBMGA013889-TA8e-11680.33% 
DrosophilaGint3-PC1e-9947.05% 
EBI UniRef50UniRef50_E1ZVL01e-12654.00%UBX domain-containing protein 1 n=11 Tax=Endopterygota RepID=E1ZVL0_CAMFO
NCBI RefSeqXP_974668.15e-13256.71%PREDICTED: similar to UBX domain-containing protein 1 [Tribolium castaneum]
NCBI nr blastpgi|910858071e-13056.71%PREDICTED: similar to UBX domain-containing protein 1 [Tribolium castaneum]
NCBI nr blastxgi|910858075e-14258.11%PREDICTED: similar to UBX domain-containing protein 1 [Tribolium castaneum]
Group
Gene OntologyGO:00055151.8e-13protein binding
KEGG pathwaytca:6635351e-131 
 K14011 (UBXN6, UBXD1)maps-> Protein processing in endoplasmic reticulum
InterPro domain[171-261] IPR0189972.7e-22PUB domain
[176-245] IPR0065671.8e-13PUG domain
[340-414] IPR0010121.1e-08UBX
Orthology groupMCL14215 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214908-TA
ATGGCTGATAAAATAAAGAAGTTTTTTCAAAAGAAAAAAATTGATGCTAAGTTCAAGTTAGCTGGACCCGGGCATAAATTAACTGAATCATCTCAATCAAGCCAGTCTTCTTTTTATAAAAAAGAAGTTCCTACAGTAAAAAGATCAGGGCTGTCAGAGGAAAGTAAAGTAGCAGCCGATGCTGCATTGGCAAGATTACAACAAAAAAGAGACAACCCTTCCTTTAACACATCTTTGGCTGCTATAAAGGCTCAAGTGAAGAAAGAATTGGAGAATGAAGTAGCATCTTCTTCAAAAGAACCAATTCAAGTGAAAGAAACTACTGAAGGAAATGTAGACATTCCTAAAAACTTGGCCGCGTCTGGTGTATACTTTAAATGTCCTATAATAAGCAATGATATTCTGTCTCGGGATGAATGGAAGAAAAATATTAAGACTTTTTTGTATGAACAATTAGAAGAAGAAAGAGGTCTTACTGCATGTCTTATAATACAGTCCTGTAATAGCAATAGAGAGAAGGTTGATATATGCGTGGAAACTCTATGCAAGTATTTAGAAAATATTGTGACACATCCCGATGATGAAAAGTATCAGAAGATTCGAATGAGCAACAGAGCATTTTGCGAAAGAGTCCAACCCATTGAAGGCTCGATGGAATTATTATTGGCAGCGGGTTTCATGCAAGAAAAACTTTTGAATAATGAAGGCAATGAAGAAGATTTTTTAGTTTTTAAAAAGGAAAATATTCCTTCAGTTGAAAGCTTGACTATGTTGATAGATGCTCTACGTACATCGGAACCGATTCCATTGGAACTTGACAGGAATCTCCAAGTATTGTTACCTTCTCAAGCAGCCAATAAAGTGCAATTACCGAGTTCATTCTACGCGCTTAGTCCAGAAGAAATTAAGAGAGAACAACAATTGAGAACCGAAGCCATGGAAAGAAGTCAAATGCTACGAACTAAAGCGATGAGGGAAAAGGACGAATTACGTGAAATGAGAAAATATAAATTTGCGATTATAAGAGTGCGTTTCCCTGACGGAATATTGTTGCAAGGCACATTTTCGGTGTACGAGCGTTATAGTGAAATACATGAATTCGTTCAAGAAAATTTGGAACACAACGGCCTTCCGTTTATACTGAACACTCCAACCGGCCACAAGATAATATATGAAGAAGATGCGAATAAAACTCTTATAGATCTAAGACTTGTACCAACAACAATGCTCACATTCGCCTGGCACAGTTCAGTCATAGACGAAATCAATAACAGCCCTAATAAGGACGTTTATTTGAAACCGGAAGTCATGGTCCTCGTACAAGAAATTTGA

Protein sequence:

>DPOGS214908-PA
MADKIKKFFQKKKIDAKFKLAGPGHKLTESSQSSQSSFYKKEVPTVKRSGLSEESKVAADAALARLQQKRDNPSFNTSLAAIKAQVKKELENEVASSSKEPIQVKETTEGNVDIPKNLAASGVYFKCPIISNDILSRDEWKKNIKTFLYEQLEEERGLTACLIIQSCNSNREKVDICVETLCKYLENIVTHPDDEKYQKIRMSNRAFCERVQPIEGSMELLLAAGFMQEKLLNNEGNEEDFLVFKKENIPSVESLTMLIDALRTSEPIPLELDRNLQVLLPSQAANKVQLPSSFYALSPEEIKREQQLRTEAMERSQMLRTKAMREKDELREMRKYKFAIIRVRFPDGILLQGTFSVYERYSEIHEFVQENLEHNGLPFILNTPTGHKIIYEEDANKTLIDLRLVPTTMLTFAWHSSVIDEINNSPNKDVYLKPEVMVLVQEI-