Monarch geneset OGS2.0

DPOGS212365
TranscriptDPOGS212365-TA741 bp
ProteinDPOGS212365-PA246 aa
Genomic positionDPSCF300019 + 97635-99300
RNAseq coverage2547x (Rank: top 5%)
Annotation
HeliconiusHMEL0053094e-10989.66% 
BombyxBGIBMGA004657-TA3e-13188.62% 
DrosophilaCG30382-PA1e-9666.67% 
EBI UniRef50UniRef50_G6CIR71e-141100.00%Proteasome subunit alpha type 6-A n=7 Tax=Bilateria RepID=G6CIR7_DANPL
NCBI RefSeqNP_001040459.14e-13088.62%proteasome subunit alpha type 6-A [Bombyx mori]
NCBI nr blastpgi|1140521606e-12988.62%proteasome subunit alpha type 6-A [Bombyx mori]
NCBI nr blastxgi|1140521603e-12388.62%proteasome subunit alpha type 6-A [Bombyx mori]
Group
Gene OntologyGO:00516038.5e-51proteolysis involved in cellular protein catabolic process
GO:00042988.5e-51threonine-type endopeptidase activity
GO:00058398.5e-51proteasome core complex
GO:00041753.6e-14endopeptidase activity
GO:00197733.6e-14proteasome core complex, alpha-subunit complex
GO:00065113.6e-14ubiquitin-dependent protein catabolic process
KEGG pathwaycqu:CpipJ_CPIJ0017072e-112 
 K02730 (PSMA6)maps-> Proteasome
InterPro domain[34-220] IPR0013538.5e-51Proteasome, subunit alpha/beta
[9-31] IPR0004263.6e-14Proteasome, alpha-subunit, conserved site
Orthology groupMCL12686 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212365-TA
ATGGCCCGTGAAAGTAGTGCTGGTTTTGATAGGCACATTACCATTTTCTCTCCTGAGGGCAGACTTTATCAAGTCGAATATGCACTGAAAGCTATCAACCAGGGTGGGCTCACTTCTGTGGCTCTGCGCGGGACGGATGCGGCCGTGGTGGCAGCGCAGCGCAAGGTGCCCGACCGACTACTCGACCCAGCATCTGTCACACATCTCTTTAGACTCACCGATAGAATTGGATGTGTTATGACTGGCATGACAGCGGACAGTCGCTCCCAGGTACAGCGTGCGCGGTACGAGGCCGCTAACTGGACCTACAAGTATGGCAGCGCCGTGCCAGCACACGTGTTGTGTCGTCGCGTGGCTGACGTGTCCCAGGTGTACACGCAGAATGCAGAGATGAGACCGCTTGGCTGCAGTATGATGTTGATCGCGTACGACGACGAGACCGGTCCGTGCGTGTACAAGACGGACCCCTCCGGGTACTACTGCTCGTACAAGGCGGTGGCAGCCGGGGCCAAGGCCACCGACGCTGGCGCCTACCTGGAGAAAAAACTCAAGAAGCGAGGCGACCTCTCCGAGGACGACGCCGTGCAGCTCGCCGTCAGCTGCCTGGCCGCCGTGCTCAGCGTGGACTTCAAAGCCGCGGAGATCGAGGTCGGAGTCGTCTCCAAGGAGCGGCCGGACTTCAGAGTGCTGACGGAGGCGGAGATCGACAGACACCTGACCGCGATCGCTGAGAAGGATTAG

Protein sequence:

>DPOGS212365-PA
MARESSAGFDRHITIFSPEGRLYQVEYALKAINQGGLTSVALRGTDAAVVAAQRKVPDRLLDPASVTHLFRLTDRIGCVMTGMTADSRSQVQRARYEAANWTYKYGSAVPAHVLCRRVADVSQVYTQNAEMRPLGCSMMLIAYDDETGPCVYKTDPSGYYCSYKAVAAGAKATDAGAYLEKKLKKRGDLSEDDAVQLAVSCLAAVLSVDFKAAEIEVGVVSKERPDFRVLTEAEIDRHLTAIAEKD-