Monarch geneset OGS2.0

DPOGS206327
TranscriptDPOGS206327-TA1416 bp
ProteinDPOGS206327-PA471 aa
Genomic positionDPSCF300082 - 154662-165824
RNAseq coverage1894x (Rank: top 7%)
Annotation
HeliconiusHMEL0027149e-6671.78% 
BombyxBGIBMGA005270-TA4e-8971.37% 
DrosophilaTrp1-PB7e-4960.00% 
EBI UniRef50UniRef50_Q16SP57e-6256.88%Putative uncharacterized protein n=3 Tax=Neoptera RepID=Q16SP5_AEDAE
NCBI RefSeqXP_318883.41e-6446.74%AGAP009788-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582987063e-6346.74%AGAP009788-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|910885618e-7150.96%PREDICTED: similar to AGAP009788-PA [Tribolium castaneum]
Group
Gene OntologyGO:00085659.8e-141protein transporter activity
GO:00160219.8e-141integral to membrane
GO:00150319.8e-141protein transport
KEGG pathwayaga:AgaP_AGAP0097884e-64 
 K12275 (SEC62)maps-> Protein processing in endoplasmic reticulum
    Protein export
InterPro domain[1-466] IPR0047289.8e-141Translocation protein Sec62
Orthology groupMCL12111 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206327-TA
ATGGCAGACAAAAGGAAAACTAAAAAACGAAAAGAGGAATTCGGTGAACCCTCCGAATCTGAGAAGCCCACAAGCGAGGAATACGCCGTCGCTAAATGGCTAAAGGCGAACGTACCCACGAAGAAGACAAAATTCCTTAATCATCACGTAGAATACTTCACAGGCACAAGAGCAGTGGACGCGTTATTGACATCAAAATGGGCGACTGGCAAGAACCCCATTTTCACTACCAGACACGACATCACAGACTTCCTTCACCTAATGCTCTTACACAAACTCTTCCACAGAGCTAAAAAGGTGCCAGTGACAGAGCAGGAGCTTAAAGGAAAATCGAGGAAGAAAGATGTCGAAAAGACGAGTAAGAGTGGGGATGAACAAGACGAGAAGAATCAGAGTGCTTGTGAGGGAAAAGAAACCAAAGACAAAGACGGTAAAGATAAGGACAAAGAGAAGAAAAAGAGGAAGATCCGTCTGGAGATGCACATGGAGCAAGTTTTCTTAGACACTAATGACGCGTACGTCTGGCTCTACGACCCCATGCCCTGGTACTATTGGCTATGCGGAGCATTACTGCTCGTGGGCACTGTAGGGGTCTGCATGTTCCCGCTATGGCCGGCCACTGTTAGGAAGAAAGATGTCGAAAAGACTAGTAAGAGTGGGGATGAACAAGACGAGAAGAATCAGAGTGCTTGTGAGGGAAAAGAAACCAAAGACAAAGACGGCAAAGAGAAGAAAAAGAGGAAGATCCGTCTGGAGATGCACATGGAGCAAGTTTTCTTAGACACCAATGACGCGTACGTCTGGCTCTACGACCCCATGCCCTGGTACTATTGGCTATGCGGAGCATTACTGCTCGTGGGCACTGTAGGGGTTTGCATGTTCCCGCTATGGCCGGCCACTGTTAGGAAGGGTGTGTACTATCTGAGTATAGCGGCGGCTGGTTTCCTGGTACTGATCATAGCGTTGGCTGTACTCAGAGTGGTAGTGTTTTGTACTGTATGGGTCGCAACACTCGCTAGACATCATCTCTGGTTACTTCCAAATCTGACTGAAGACGTTGGATTCTTTGCCTCATTCTGGCCGCTGTATAAGTACGAATATCGCGGTCCCGGTTCGGAGAGCGATAAATCGTCTAAAAGCAAGAAGAAACGCAAGAAAGAAAAACATTCAGACGACGAGGAGGAAAAGACAGCTCTCATGAAGGAATCGGAGGCAAAGGAAGTTAAAGAGAAAAAAGTAGTCTCTGAAACGGCCGACACTACAGACAATAAGGAGGAACCACCGCAGCCTGAAACATCGGAACAGACGGACAAACTGTCGGAATCAGAATCTGAAAACAGCCAGAGGTCATCGACCGACAGAGACTTCGAGATGATAGATACAGCTGATGTAGACGAACATGCACACACACACTAA

Protein sequence:

>DPOGS206327-PA
MADKRKTKKRKEEFGEPSESEKPTSEEYAVAKWLKANVPTKKTKFLNHHVEYFTGTRAVDALLTSKWATGKNPIFTTRHDITDFLHLMLLHKLFHRAKKVPVTEQELKGKSRKKDVEKTSKSGDEQDEKNQSACEGKETKDKDGKDKDKEKKKRKIRLEMHMEQVFLDTNDAYVWLYDPMPWYYWLCGALLLVGTVGVCMFPLWPATVRKKDVEKTSKSGDEQDEKNQSACEGKETKDKDGKEKKKRKIRLEMHMEQVFLDTNDAYVWLYDPMPWYYWLCGALLLVGTVGVCMFPLWPATVRKGVYYLSIAAAGFLVLIIALAVLRVVVFCTVWVATLARHHLWLLPNLTEDVGFFASFWPLYKYEYRGPGSESDKSSKSKKKRKKEKHSDDEEEKTALMKESEAKEVKEKKVVSETADTTDNKEEPPQPETSEQTDKLSESESENSQRSSTDRDFEMIDTADVDEHAHTH-