Monarch geneset OGS2.0

DPOGS202680
TranscriptDPOGS202680-TA981 bp
ProteinDPOGS202680-PA326 aa
Genomic positionDPSCF300039 + 813240-817647
RNAseq coverage650x (Rank: top 20%)
Annotation
HeliconiusHMEL0206532e-14875.60% 
BombyxBGIBMGA001303-TA8e-15083.22% 
DrosophilaCG7830-PA3e-11157.45% 
EBI UniRef50UniRef50_E0VCH41e-9252.87%Tumor suppressor candidate, putative n=2 Tax=Pediculus humanus corporis RepID=E0VCH4_PEDHC
NCBI RefSeqXP_001650913.11e-11360.00%hypothetical protein AaeL_AAEL005457 [Aedes aegypti]
NCBI nr blastpgi|1571100002e-11260.00%hypothetical protein AaeL_AAEL005457 [Aedes aegypti]
NCBI nr blastxgi|910813593e-11260.94%PREDICTED: similar to CG7830 CG7830-PA [Tribolium castaneum]
Group
KEGG pathwayaag:AaeL_AAEL0054573e-113 
 K12669 (OST3, OST6)maps-> Protein processing in endoplasmic reticulum
    N-Glycan biosynthesis
InterPro domain[5-326] IPR0068445.5e-138Magnesium transporter protein 1
[156-311] IPR0211494.7e-41Oligosaccharyl transferase complex, subunit OST3/OST6
[31-165] IPR0123368.3e-09Thioredoxin-like fold
Orthology groupMCL11720 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202680-TA
ATGAAATACAAATCCCTATTATTTTTTACATTATTCTTAACTTATTACGATGCAAATGCTCAAAACCGTGGTAAAGGACTCGAAGAGAAAGTCCGTAATCTTAAAGACATGACTGTGAAATATTCGATGATTTCCCTAAATCTCAATAGATTCAAAGAGTACGTACGCTCTCCGCCTAGAGACTACTCCTTCGTGGTCATGTTCACAGCAATGGCACCGTCTAGAAAGTGTGCTATCTGCCAGCATGTTTACGACGAGTATACGATCGTTGCTAATTCATTCCGTTTCTCATCTGCCTACTCTGATAAACTCTTCTTTGGCATAGTTGACTTCGATGAAGGATCTGATATTTTCCAAATGTTACGTTTGAATACAGCTCCAGTGATTATGCACTTCCCCGCCAAAGGTAAACCAAAACCCGCTGATTCCATGGACTTTGAGAGAGCTGGTATTCATGCGGAGGCCATTGCTAAGTGGATCCAAGACAGAACTGATGTTCAAATCAGAATATTCCGTCCACCGAACTATTCAGGAGTCGCAGTATTCTTAGTTCTGTTCATCTTCATAGCTATTTTCCTCTATGTCCGTCGGAATAATCTTGAGTTCCTATACAATAAGCAGATGTGGGCCATTATAGCTGTGTTCATATGTTTCGCCATGGTGTCAGGTCAGATGTGGAATCAAATACGAGGCCCACCATTCTTCCACAGGACCAAGAACGGCCCCGTTTATATCAATGGTGGTTCCCATGGGCAGTTTGTCCTTGAAAGCTATATTGTCGCTATGTTAAATTGTGCTGTAGTGGTTGGTATGATATTGATGATTGAAGCTGCTGGAGGAGTGAACGGGAAAGATGTTCGAGCCCAGGAGGGGAAGAGAAGGAGGTTCTATTCTGTGGTTGGTTTAGTTCTGGTGTGCGTCTTCTTTTCTTTACTTTTATCTGTCTTCAGAGCTAAGACACAAGGATATCCTTACAGGTAA

Protein sequence:

>DPOGS202680-PA
MKYKSLLFFTLFLTYYDANAQNRGKGLEEKVRNLKDMTVKYSMISLNLNRFKEYVRSPPRDYSFVVMFTAMAPSRKCAICQHVYDEYTIVANSFRFSSAYSDKLFFGIVDFDEGSDIFQMLRLNTAPVIMHFPAKGKPKPADSMDFERAGIHAEAIAKWIQDRTDVQIRIFRPPNYSGVAVFLVLFIFIAIFLYVRRNNLEFLYNKQMWAIIAVFICFAMVSGQMWNQIRGPPFFHRTKNGPVYINGGSHGQFVLESYIVAMLNCAVVVGMILMIEAAGGVNGKDVRAQEGKRRRFYSVVGLVLVCVFFSLLLSVFRAKTQGYPYR-