Monarch geneset OGS2.0

DPOGS210106
TranscriptDPOGS210106-TA1254 bp
ProteinDPOGS210106-PA417 aa
Genomic positionDPSCF300017 + 1097891-1102782
RNAseq coverage4050x (Rank: top 3%)
Annotation
HeliconiusHMEL0146230.090.08% 
BombyxBGIBMGA012687-TA0.097.69% 
DrosophilaSec61alpha-PA0.094.09% 
EBI UniRef50UniRef50_Q9H9S30.091.77%Protein transport protein Sec61 subunit alpha isoform 2 n=126 Tax=Coelomata RepID=S61A2_HUMAN
NCBI RefSeqNP_001037628.10.097.69%transport protein Sec61 alpha subunit [Bombyx mori]
NCBI nr blastpgi|1129833700.097.69%transport protein Sec61 alpha subunit [Bombyx mori]
NCBI nr blastxgi|1129833700.097.69%transport protein Sec61 alpha subunit [Bombyx mori]
Group
Gene OntologyGO:00160202.6e-92membrane
GO:00154502.6e-92P-P-bond-hydrolysis-driven protein transmembrane transporter activity
GO:00150312.6e-92protein transport
KEGG pathwaycqu:CpipJ_CPIJ0077230.0 
 K10956 (SEC61A)maps-> Phagosome
    Vibrio cholerae infection
    Protein processing in endoplasmic reticulum
    Protein export
InterPro domain[2-389] IPR0022080SecY protein
[7-389] IPR0232016.3e-136SecY subunit domain
[40-74] IPR0195614.4e-18Translocon Sec61/SecY, plug domain
Orthology groupMCL10778 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210106-TA
ATGGGAATCAAATTCCTGGAAGTTATAAAACCGTTTTGCAGTATACTGCCAGAAATAGCGAAACCGGAGAGAAAGATCCAATTCAGAGAAAAAGTATTATGGACAGCAATTACACTATTCATTTTCTTAGTATGCTGCCAGATTCCATTATTTGGTATAATGTCGTCAGACAGTGCGGATCCCTTCTATTGGATCCGTGTGATCTTGGCATCGAATCGAGGAACACTTATGGAGCTGGGTATTTCCCCCATCGTCACATCAGGACTGATCATGCAACTGCTAGCTGGAGCTAAGATTATAGAAGTCGGTGACACTCCCAAAGATAGGGCCTTGTTTAATGGGGCGCAGAAACTATTCGGCATGGTGATAACAGTGGGACAGGCCATAGTGTATGTCATGACGGGAATGTACGGTGAACCTAGTGAGATTGGTGCCGGAGTCTGTCTGCTCATCATCATACAGTTGTTTGTGGCCGGACTTATTGTACTGCTGCTCGATGAATTACTTCAGAAAGGTTATGGTCTTGGCTCCGGTATTTCCCTCTTCATTGCCACCAACATTTGTGAAACAATCGTATGGAAGGCTTTCTCACCGGCTACCGTCAACACTGGTCGCGGTACAGAGTTTGAAGGCGCAGTGATAGCATTATTCCACTTGCTGGCCACTAGACCCGATAAAGTCCGAGCACTCAGAGAAGCCTTCTACCGTCAGAATCTACCAAATTTGATGAACCTCCTAGCGACAGTCCTAGTGTTTGCTATCGTGATATACTTCCAGGGCTTCAGGGTGGATCTCCCCATCAAGTCAGCTCGTTACCGCGGCCAGCACTCTTCGTACCCCATCAAACTGTTCTACACCTCAAACATTCCAATCATTCTTCAATCCGCCCTCGTCTCCAATCTGTATGTTATCTCTCAGATGTTAGCTGTGAAGTTCAGCGGCAACTTCCTGGTGAACTTACTTGGTGTGTGGGCAGACGTGGGCGGTGGTGGCCCCGCCCGCGCCTATCCCGTGGGCGGTCTGTGCTACTACTTCAGCCCCCCGGAGTCGCTCGCCCACATCGCTCACGACCCGCTTCACGCCGTCATGTACATCATCTTCATGTTGGGCTCCTGCGCATTCTTCTCAAAGACATGGATCGATGTTTCTGGATCATCAGCTAAGGATGAACCAAACTCCTATCCCGTATGTTGGTCCCGCACCAAGGTCAGATCGTTCCATCGCTCGATAACGGAAGACGTATATAAACCGTAA

Protein sequence:

>DPOGS210106-PA
MGIKFLEVIKPFCSILPEIAKPERKIQFREKVLWTAITLFIFLVCCQIPLFGIMSSDSADPFYWIRVILASNRGTLMELGISPIVTSGLIMQLLAGAKIIEVGDTPKDRALFNGAQKLFGMVITVGQAIVYVMTGMYGEPSEIGAGVCLLIIIQLFVAGLIVLLLDELLQKGYGLGSGISLFIATNICETIVWKAFSPATVNTGRGTEFEGAVIALFHLLATRPDKVRALREAFYRQNLPNLMNLLATVLVFAIVIYFQGFRVDLPIKSARYRGQHSSYPIKLFYTSNIPIILQSALVSNLYVISQMLAVKFSGNFLVNLLGVWADVGGGGPARAYPVGGLCYYFSPPESLAHIAHDPLHAVMYIIFMLGSCAFFSKTWIDVSGSSAKDEPNSYPVCWSRTKVRSFHRSITEDVYKP-