Monarch geneset OGS2.0

DPOGS203393
TranscriptDPOGS203393-TA1323 bp
ProteinDPOGS203393-PA440 aa
Genomic positionDPSCF300003 + 873479-876941
RNAseq coverage602x (Rank: top 21%)
Annotation
HeliconiusHMEL0166241e-7481.21% 
BombyxBGIBMGA012292-TA1e-14873.95% 
DrosophilaUch-L3-PA9e-11059.87% 
EBI UniRef50UniRef50_Q9XZ611e-10759.87%26S proteasome regulatory complex subunit p37A n=35 Tax=Eumetazoa RepID=Q9XZ61_DROME
NCBI RefSeqXP_002431967.16e-12062.23%ubiquitin carboxyl-terminal hydrolase isozyme L5, putative [Pediculus humanus corporis]
NCBI nr blastpgi|1839792681e-13771.96%similar to CG3431-PA [Papilio xuthus]
NCBI nr blastxgi|1839792686e-13471.96%similar to CG3431-PA [Papilio xuthus]
Group
Gene OntologyGO:00065112.4e-157ubiquitin-dependent protein catabolic process
GO:00056222.4e-157intracellular
GO:00042212.4e-157ubiquitin thiolesterase activity
KEGG pathway 
InterPro domain[1-342] IPR0015782.4e-157Peptidase C12, ubiquitin carboxyl-terminal hydrolase 1
Orthology groupMCL12912 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203393-TA
ATGGAGTCTGCGGGTGACTGGTGCCTGATCGAAAGTGATCCTGGCGTCTTTACGCAACTGATTAAGAAATTTGGTGTTAAGGGTGTCCAAGTGGAAGAGATGTGGACTTTTGATGACAGCATTTTTGATAACCTAAGACCCGTTCATGGATTGATATTCTTGTTCAAATATTTGCAACATGATGAACCTCCTCATCCCGTAGTCAAGGACAACAGGCTGGAGAAAATATATTTTGCTAAACAGGTGATAAACAATGCATGTGCGACACAAGCTGTTGTAAGTCTCCTTTTGAATTGCAACCATCCAGATGTCATATTAGGGCCGGAGTTGACAAAGTTAAAGGAATTCAGTATGTCATTTGATCCAAGGATGCGCGGTCTCACACTAAGCAACTCTCAGACTATAAGAAGTGCACATAACTCTATGTCCCAACAAGCTCTGTTTGAATTTGATCCAAAAGTCCCCACAAAGGATGAGGATGCTTACCATTTCATTGGATATATGCCAATTGATGGACGGCTGTATGAACTAGATGGACTCCGCGAGGGACCCATCGATCATGGACCGATTGCTCCGGAACAAGATTGGTTGGATGTCGTACGTCCTATTATTGTGTCTCGCATTAATGTATACACGGAAGGCGAAATACATTTTAACCTAATGGCTCTTGTATCAGATAGAAAAATGATATACGAGAGACAAATACAGGCGCTCATGAGTGAGACCAGGATGCTTGGCATGGAAACAGATGACGTGGACGTTGAAATAAGAAGATTGCGTATGCTAATAGAATACGAAGACGCCAAAATGTTGAGATACAACCAGGAGATGTTGAGGAGACGGCATAATTACTTGCCGTTCATCATCACACTGCTCAAGATATTGGCGGAGGAGAAGAAGTTGTCGCCTCTCTTGGAAAAGGCGAAGGAGCGCGCGCTCAAGAAGGGGCCTAAGAAAGTGAAATCCAGCCCCTCGCGGGTGCTGCCGCCCTCCGAGCACATGCAATATTCTTTAGCAGATTTAGATTCTATATTTAGAGAGCCTCAAGAGCCTCCGAGCCAGTTGGAGCCGCAGGACGTTTTGCAGCAGAGCTTATTAGAACCGCCAGCTGACATGGACATGTTGGCCACCGACGACTTCCTCAAAGACGTCGTCCTGGACACCGGCTTCGAGCACGACCTCATAGACGTTCATGATATATTTGACGAAAATATGATACCAACCGAAGAATTCCCACACGACGACATCTTAGACCCCGATGAACTGCTCATCAGAGACTACATGCGCAATCCCGGAAACGAAGAAGAACGCGATGCGGAATAG

Protein sequence:

>DPOGS203393-PA
MESAGDWCLIESDPGVFTQLIKKFGVKGVQVEEMWTFDDSIFDNLRPVHGLIFLFKYLQHDEPPHPVVKDNRLEKIYFAKQVINNACATQAVVSLLLNCNHPDVILGPELTKLKEFSMSFDPRMRGLTLSNSQTIRSAHNSMSQQALFEFDPKVPTKDEDAYHFIGYMPIDGRLYELDGLREGPIDHGPIAPEQDWLDVVRPIIVSRINVYTEGEIHFNLMALVSDRKMIYERQIQALMSETRMLGMETDDVDVEIRRLRMLIEYEDAKMLRYNQEMLRRRHNYLPFIITLLKILAEEKKLSPLLEKAKERALKKGPKKVKSSPSRVLPPSEHMQYSLADLDSIFREPQEPPSQLEPQDVLQQSLLEPPADMDMLATDDFLKDVVLDTGFEHDLIDVHDIFDENMIPTEEFPHDDILDPDELLIRDYMRNPGNEEERDAE-