Monarch geneset OGS2.0

DPOGS216077
TranscriptDPOGS216077-TA1311 bp
ProteinDPOGS216077-PA436 aa
Genomic positionDPSCF300067 + 499202-502756
RNAseq coverage1021x (Rank: top 12%)
Annotation
HeliconiusHMEL0089440.081.88% 
BombyxBGIBMGA008874-TA0.080.50% 
DrosophilaFaf-PA4e-12851.08% 
EBI UniRef50UniRef50_Q17DA07e-12848.64%Fas-associated protein n=14 Tax=Neoptera RepID=Q17DA0_AEDAE
NCBI RefSeqXP_001961665.17e-12951.09%GF14818 [Drosophila ananassae]
NCBI nr blastpgi|1947588381e-12751.09%GF14818 [Drosophila ananassae]
NCBI nr blastxgi|1700625423e-12448.97%UBX domain-containing protein 8 [Culex quinquefasciatus]
Group
Gene OntologyGO:00055151.4e-14protein binding
KEGG pathwaypgu:PGUG_054771e-13 
 K14013 (UBX2, SEL1)maps-> Protein processing in endoplasmic reticulum
InterPro domain[139-263] IPR0065779.1e-42UAS
[358-431] IPR0010121.4e-14UBX
Orthology groupMCL13839 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216077-TA
ATGGATTTAGAAGACAATGCTTTGGGTTTGACCCAAGATCAAACTGAAAAAATGTTGCAATTCCAAGACCTAACAGGCATCGAAGATATATCAATATGCAGGGACGTATTACAAAGGCATCAGTGGGATTTGGAGGTTGCAATACAGGAACAGCTCAATATTAGGGAAGGTCGGCCATCAGTATTTGCTACAGAAGCAAGAGCACCTACGGTAGTACATGATCACATTGCTCAGCAAGTTTTTACAGATGATGGATCAGATGGACCTGGAGGAGTAAGGGGACTTTTCCGTTATGTTGTCAACTTAGTTGTATCAATGTGTTACAGCACTATAACTTCTGTATTGAACTTGCTGCTAAGTTTTGTAAGAAACGATGATAGAAGATTGGTTACCGATCAACTTGGTGATGTTATGGGTTTTATCAACAATTACACATCAAGATTTAGCCCGCATCCCGTTTTCTATCAGGGGACTTATGCACAGGCTCTCAATGATGCAAAGAATGAGCTGAGATTTTTAATTGTCTATCTGCATTCAGAATCTGCAACAGAGACACAAAACTTTTGCAGAACAACACTGGCTGACCCTGATGTAATACAGTACATAAACACACATGCTCTGTTTTGGGGTTGTTCCATTGACACTTCGGAGGGTTGGCGAGTAGCTCAGTCTGTGGGGGGTCGTAGATATCCTTTGATGTGCGTCGTATGTGTGAGAGACCATCGTATGACGGTGGTAGCTAGGAGTGAAGGTGCTTGCGCGCCTCAACAGCTGTTACAACGACTACAAAGAGTCGTTACTGAGAATGAGCCACATCTCGCCGCTGCTAGGGCTGACAGAGTCGAGCGCGAGGTGACGGCGCGTCTGCGTGCGGCTCAGGACGAGGCTTACGCGGAGTCGCTGGCGGCTGATCAGGAGAAGGAGAGGAAGAAGGAGAGAGAGCGAGAGGCCCGGGACCAGCTGGAGAGGGATACTCTACACAGACAGATGATGGAGGAACAGCATCGACAACAGGTGATCGAAGCTCGTGCGGCTATGGCGGCCAGTCTGCCCGAGGAGCCGGCCACGGGATCGACGGCCGTTGCGTTACTCATACGACTGCCGTGCGGGGAGAGACTCACGAGGAGGTTCTACCTCGTGGATACTACGCAGGATCTGTACAATTTCGTCTTCAGTCATCCGCAGTCTCCTGAGGAGTTTGAGATCACCACTAATTTCCCTAAACGTGTAATCGCCAGAGGTCCGTCGACCTTAACGGATGTGGGCCTCAAGGATCGGGACGTGCTGTTCGTTAATGATACAAACGCATAA

Protein sequence:

>DPOGS216077-PA
MDLEDNALGLTQDQTEKMLQFQDLTGIEDISICRDVLQRHQWDLEVAIQEQLNIREGRPSVFATEARAPTVVHDHIAQQVFTDDGSDGPGGVRGLFRYVVNLVVSMCYSTITSVLNLLLSFVRNDDRRLVTDQLGDVMGFINNYTSRFSPHPVFYQGTYAQALNDAKNELRFLIVYLHSESATETQNFCRTTLADPDVIQYINTHALFWGCSIDTSEGWRVAQSVGGRRYPLMCVVCVRDHRMTVVARSEGACAPQQLLQRLQRVVTENEPHLAAARADRVEREVTARLRAAQDEAYAESLAADQEKERKKEREREARDQLERDTLHRQMMEEQHRQQVIEARAAMAASLPEEPATGSTAVALLIRLPCGERLTRRFYLVDTTQDLYNFVFSHPQSPEEFEITTNFPKRVIARGPSTLTDVGLKDRDVLFVNDTNA-