Monarch geneset OGS2.0

DPOGS202154
TranscriptDPOGS202154-TA2037 bp
ProteinDPOGS202154-PA678 aa
Genomic positionDPSCF300162 - 105388-110951
RNAseq coverage1082x (Rank: top 12%)
Annotation
HeliconiusHMEL0208733e-10352.26% 
BombyxBGIBMGA003428-TA0.094.25% 
DrosophilaCG1518-PA0.078.69% 
EBI UniRef50UniRef50_Q9VRE00.078.69%CG1518 n=99 Tax=root RepID=Q9VRE0_DROME
NCBI RefSeqXP_001648748.10.079.85%oligosaccharyl transferase [Aedes aegypti]
NCBI nr blastpgi|2700065620.080.83%hypothetical protein TcasGA2_TC010433 [Tribolium castaneum]
NCBI nr blastxgi|2700065620.080.71%hypothetical protein TcasGA2_TC010433 [Tribolium castaneum]
Group
Gene OntologyGO:00160205.3e-171membrane
GO:00064865.3e-171protein glycosylation
GO:00045765.3e-171oligosaccharyl transferase activity
KEGG pathwayaag:AaeL_AAEL0042280.0 
 K07151 (STT3)maps-> Protein processing in endoplasmic reticulum
    N-Glycan biosynthesis
InterPro domain[2-627] IPR0036745.3e-171Oligosaccharyl transferase, STT3 subunit
Orthology groupMCL10156 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202154-TA
ATGGCAGCGATTTTATCGTTTGGAACACGTCTTTTCTCAGTTTTACGCTTCGAAAGCGTAATCCATGAGTTCGACCCCTACTTTAACTACAGAACAACGCGATATCTGACAGAAGAAGGGTTCTATAAATTCCACAATTGGTTTGATGACAGAGCATGGTACCCTCTAGGACGTATCATTGGTGGTACTATTTACCCTGGGCTGATGGTGACATCGGCCACATTATACAACATAATGCAGTTCCTCAACATAACAATAGACATCAGGAATGTATGTGTGTTCCTGGCTCCGTTCTTCTCATCGCTCACAACTATTGTCACCTACCTGCTGACTAAGGAGTTGAAGGATGAGGGTGCGGGGTTGGTCGCTGCGGCGATGATTGCCATCGTCCCCGGCTACATCAGTCGTTCGGTCGCTGGCAGTTATGACAACGAGGGTATTGCCATCTTCTGTATGCTCCTGACATATTATTTCTGGATCAAGGCCGTCAACACCGGTACTATTTTGTGGGCAACTATGACCGCATTGGCATACTTCTATATGGTGTCTTCATGGGGAGGTTACGTGTTCCTAATCAACTTGATCCCTCTCCATGTCCTGGCCCTGATCCTACTGGGGCGTTTTTCTCACCGTGTGTATGTAGCCTACAGTACCCTGTACTGTGTGGGGACGGTACTCTCTATGCAGATCAGTTTTGTTGGCTTCCAGCCGGTACAGAGCTCGGAACATATGCTTGCGTTAGGCACTTTTGGTCTATGCCAGCTGTATTCCTTCACTCAGTACCTTCGAGCTCGTCTCTCTCCAGCAAACTTTGAGCTGTTGTTCAAGGCCCTACTGACTACCCTACTGGCCACCCTCGGAACAGCACTCGTAGTGCTCACCGTGACCGGAAAAATATCACCATGGACCGGCAGATTCTATTCTCTACTGGACCCGTCCTACGCTAAAAATCACATTCCTATCATTGCATCTGTGTCCGAGCATCAGCCAACGTCGTGGTCTTCGTTCTACTTTGATCTGCAAGTGCTGGTGTTCCTCTTCCCCGCCGGCCTCTACTTCTGCTTCACCAAACTCACGGACGCCAACATCTTCATCATACTATACGGAGTACTCAGCATCTACTTTGCGGGTGTGATGGTCAGGTTGATGTTGGTGCTGGCTCCCGTCATGTGCATCGTGTCAGGAGTGGCCGCCTCCAGTCTGCTGAGCCTTCACGTTAAAGACATCGAACCTAAAGCTGAAAAACCGGACAAGAAGAAGAAACACGAAAATAATCTGGTGTTCAGGTCTGAGGTGGGGGCTCTGTTCGTGATGGTGCTCTGTGGCCTGTTGGTGTCATACGTGTTCCACTGCACCTGGGTGACGTCGGAGGCGTACTCCTCGCCCTCCATAGTGCTGTCGGCGCGGGCGCACGACGGCGCCAGGATCATATTCGACGACTTCAGGGAGGCTTACACCTGGCTCAAGATGAACACTCCAGAGGACGCGAAGGTGATGTCGTGGTGGGACTACGGCTATCAGATAACAGCGATGGCGAACAGAACGGTGATAGTCGACAACAACACTTGGAACAACACTCACATCTCGCGAGTGGGCCAGGCCATGGCGTCTAGTGAGGAACACGCCTACGAGATCATGAGGGAGCTGGACGTGGATTACGTGCTCGTTATATTCGGAGGACTCGTAGGATACTCGTCTGACGACATCAACAAGTTCCTGTGGATGGTGCGTATCGGCGGCAGTACAGACCGCGGCGCGCACATCAAGGAGGCCGACTATTACACCGGCGCGGGCGAGTTCCGCGTCGACGCACACGGGGCGCCCGCGCTGCTCAACTGCCTCATGTACAAGATGAGCTACTACAAATTCGGCTTGGTGTACACCGAGGGCGGCCGGCCTCCGGGATACGACCGCGTGCGTGGCGCCGAGATCGGCAACAAGGACTTCAATCTAGACGTACTAGAAGAAGCGTACACCACGGAGCACTGGCTGGTCCGCGTCTACAAGGTCAAGCCGCTGCCCAACCGCGGCCTTTGA

Protein sequence:

>DPOGS202154-PA
MAAILSFGTRLFSVLRFESVIHEFDPYFNYRTTRYLTEEGFYKFHNWFDDRAWYPLGRIIGGTIYPGLMVTSATLYNIMQFLNITIDIRNVCVFLAPFFSSLTTIVTYLLTKELKDEGAGLVAAAMIAIVPGYISRSVAGSYDNEGIAIFCMLLTYYFWIKAVNTGTILWATMTALAYFYMVSSWGGYVFLINLIPLHVLALILLGRFSHRVYVAYSTLYCVGTVLSMQISFVGFQPVQSSEHMLALGTFGLCQLYSFTQYLRARLSPANFELLFKALLTTLLATLGTALVVLTVTGKISPWTGRFYSLLDPSYAKNHIPIIASVSEHQPTSWSSFYFDLQVLVFLFPAGLYFCFTKLTDANIFIILYGVLSIYFAGVMVRLMLVLAPVMCIVSGVAASSLLSLHVKDIEPKAEKPDKKKKHENNLVFRSEVGALFVMVLCGLLVSYVFHCTWVTSEAYSSPSIVLSARAHDGARIIFDDFREAYTWLKMNTPEDAKVMSWWDYGYQITAMANRTVIVDNNTWNNTHISRVGQAMASSEEHAYEIMRELDVDYVLVIFGGLVGYSSDDINKFLWMVRIGGSTDRGAHIKEADYYTGAGEFRVDAHGAPALLNCLMYKMSYYKFGLVYTEGGRPPGYDRVRGAEIGNKDFNLDVLEEAYTTEHWLVRVYKVKPLPNRGL-