Monarch geneset OGS2.0

DPOGS202056
TranscriptDPOGS202056-TA1512 bp
ProteinDPOGS202056-PA503 aa
Genomic positionDPSCF300053 + 852171-854884
RNAseq coverage269x (Rank: top 40%)
Annotation
HeliconiusHMEL0167690.065.93% 
BombyxBGIBMGA012532-TA6e-15959.95% 
DrosophilaCG12096-PA5e-4428.83% 
EBI UniRef50UniRef50_E2B9T44e-6632.13%26S proteasome non-ATPase regulatory subunit 5 n=6 Tax=Formicidae RepID=E2B9T4_HARSA
NCBI RefSeqXP_001605001.11e-7932.40%PREDICTED: hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|1565457263e-7832.40%PREDICTED: 26S proteasome non-ATPase regulatory subunit 5-like [Nasonia vitripennis]
NCBI nr blastxgi|1565457268e-7832.46%PREDICTED: 26S proteasome non-ATPase regulatory subunit 5-like [Nasonia vitripennis]
Group
Gene OntologyGO:00441835.9e-97protein binding involved in protein folding
GO:00054883.3e-18binding
KEGG pathway 
InterPro domain[16-502] IPR0195385.9e-9726S proteasome non-ATPase regulatory subunit 5
[3-470] IPR0160243.3e-18Armadillo-type fold
[285-433] IPR0119891e-10Armadillo-like helical
Orthology groupMCL11927 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202056-TA
ATGTCGAATCGAGACCTGGACGAGTTAAGAAATTGCTTTAATAGACTAAAAATAAAGGAAAACGTTCCGACGGCACTAAATGAGATAAAAAGTATTTTAGCTTACAAACCAGCTGCCGAAGCTGCCCCAACCATCAGGAACATTGGGATTTCCAAAATCCTATATTGCATCAACACCAGTACCAAAAGTGAAGCTGAACTTGCTTGTGATGTCTTAAAGATATGTTTTGATAAATTTGAACCTGGAGAAGCTATTAGGTGTTATATCAGTCACTTCATGTATTTATTGCGGCATGAAAGGGACTGTGTCCGAAGATTGGCGGTGGATGAGGTCTATAAGGCTATAACATCAAATGTCAATGTGCTTCCGTTGCCACAATACATAGATGTATATGTGGCAGTCGCTCAAATGATCTGTGATGTTGATATTGGCATAGCCAATAAAGCAGTTTTGATAACGAGCAATTTGCCTACGGAAGCTTATCCAAAAGTTTTGGAAGAAATGAAAATTGCTCTAGATTGCAACAATAGTGCCAAATGTAATGCTTATGAGGTGGTTATGAACATATCCTCAAAGTCATATGATTTATTCCAATTGTGTGCTCAAGAAAGATATATAGATTTTATGGTAAATGAGCTCAACTCGGATGATATCTTATATCAGCTGAATATTCTCGAACTTTTGTCACAATTAGCTGTAAAATCTCATGGAATAAACTATCTTGTTAAACAAGGGATGCTACAAAAAATCGCTGATCAAGTCAAAGAGTTGCATAGCAATCCATTCGGATCACTTTTAACTCCAGGTTACATGAAATTTTTTGGTTACATTGCTTACAATTATCCGAAAGAAATTTTTGGAAAATATCCGATCCTATTGGAAACACTCTTTGAAGCTTTGGATTCTAATGATTCCAATCTTTTGCCTGTTGCTATTGACACGCTGGGTTTCATTGGAACCACGAATGAAGGGAAGATGTGTTTAGCTACATTAGGCAGCAAATATACTCAGAATATTGAAAGATTGGGTAGCATTATAAGAAATAGCCCCTCTGAATTACGGATTCGAGCCCTACGTTGCATGGGAAATCTAATAAGTGTGGATAAAGATCCAAATTCAAAAGTTGAATCTGTTGATCAGAGAATAACTTTGATGACGCGCGAATGGTTCAGGATTTTGAGTAAGCAGCCGTCATCTATGGAAGTTTTATATGGCATTTGTAAGAATCCATTCCCCGATATAAAGTTGGCTGGATTGATTTTACTTGATGCTGCTTGTCAACACCAGTGGGGTGAGGAGTCGGTGGCGAGGGTAGCAGGTTTTATCGAGTATTTATTGGATCGGACAACGGATTTCAACAAGGAATGCAATGAAGCTAAATATGATATAATAAAGAGGTTAGCTTCATCTACAGCTTTTGATGAAGCCATTATCATTCGGCTCCAAAAATATGTGGAACTTGGACCGTTCCAGTCTGAAACTACTTTAGAAGTTGCTATGGATGGCGAATAA

Protein sequence:

>DPOGS202056-PA
MSNRDLDELRNCFNRLKIKENVPTALNEIKSILAYKPAAEAAPTIRNIGISKILYCINTSTKSEAELACDVLKICFDKFEPGEAIRCYISHFMYLLRHERDCVRRLAVDEVYKAITSNVNVLPLPQYIDVYVAVAQMICDVDIGIANKAVLITSNLPTEAYPKVLEEMKIALDCNNSAKCNAYEVVMNISSKSYDLFQLCAQERYIDFMVNELNSDDILYQLNILELLSQLAVKSHGINYLVKQGMLQKIADQVKELHSNPFGSLLTPGYMKFFGYIAYNYPKEIFGKYPILLETLFEALDSNDSNLLPVAIDTLGFIGTTNEGKMCLATLGSKYTQNIERLGSIIRNSPSELRIRALRCMGNLISVDKDPNSKVESVDQRITLMTREWFRILSKQPSSMEVLYGICKNPFPDIKLAGLILLDAACQHQWGEESVARVAGFIEYLLDRTTDFNKECNEAKYDIIKRLASSTAFDEAIIIRLQKYVELGPFQSETTLEVAMDGE-