Monarch geneset OGS2.0

DPOGS211166
TranscriptDPOGS211166-TA2346 bp
ProteinDPOGS211166-PA781 aa
Genomic positionDPSCF300007 + 280761-285080
RNAseq coverage181x (Rank: top 49%)
Annotation
HeliconiusHMEL0172190.080.41% 
BombyxBGIBMGA003152-TA0.076.00% 
Drosophilart-PA0.051.80% 
EBI UniRef50UniRef50_UPI00022C97B40.056.65%UPI00022C97B4 related cluster n=4 Tax=Caenorhabditis elegans RepID=UPI00022C97B4
NCBI RefSeqXP_623815.20.058.15%PREDICTED: similar to rotated abdomen CG6097-PA [Apis mellifera]
NCBI nr blastpgi|3504131450.056.65%PREDICTED: hypothetical protein LOC100749410 [Bombus impatiens]
NCBI nr blastxgi|3320258990.055.96%Protein O-mannosyltransferase 1 [Acromyrmex echinatior]
Group
Gene OntologyGO:00160201.5e-55membrane
GO:00064931.5e-55protein O-linked glycosylation
GO:00000301.5e-55mannosyltransferase activity
KEGG pathwayame:5514200.0 
 K00728 (POMT)maps-> O-Mannosyl glycan biosynthesis
InterPro domain[83-318] IPR0033421.5e-55Glycosyl transferase, family 39
[347-539] IPR0036084.7e-53MIR
[350-411] IPR0160932.2e-13MIR motif
Orthology groupMCL15389 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211166-TA
ATGGCAAATGAAATATCTCTGAAGCAACGTAAAACTTCTAAAAAATCAAACGAGGTCAGCAAACTTTCCGAAGCTGATTTGGCGGATTCAATAAATTCCTCAATAATTGCGAAAAGAGAAGTGGGCGATGGAGCAGCAGCTGAAAATGAGGTGGAATATGATAAGAATACTGATCACAAAGATGAATTAGAAAAGCCAATACGAAGCATTTATTTCAGGATTGAAATAGATGTTGTGTTTCTATTGCTTCTTTTATTAGCTCTGGCTACAAGGCTTTATAAATTAGAAGAACCCAATTACATAGTATTTGATGAATTACATTATGGAAGATATGTGACTTTATACACAAAAGGAATATTCTTTTTTGATGCGCACCCACCTTTAGGGAAGCAATTGTTGTACTTGGCAGGACAATGGGCTGGTTATAACGGAAATTTTACTTTTGATAGAATAGGTTCCCCTTATAGTGATAGTGTACCTATAAGAGCCTTGCGTTTGGTTCCCGCTGTCTGCGGGAGTTTACTAGTACCTGTTACATACCAGTTGATGTTGGAAATATGTACATATCAATTTACTGCCATATTAGCTGCAGCTCTTGTCTTATTTGAAAATTGCTTTCTGGCGCAATCGAGGTTTATGCTGTTGGAGAGCATTCAGATTCTGTTTGGATTGTGGGGATTATTATGTATTATAAAGAGTTCTCGTAAAACCGGTGCAGCGTCAGTTATTTGGCTATGTGTTGGAGCGTTTTCGCTAGGCTGCTGTTTCTCAGTAAAATATTCAGGGTTGTACACATATTATCTCGCGACATTTTTAGTTGGCCGACAAATGTGGCAGCTTATTGGGAAAATCAAATCAATGCTAAAACTGTTCTTGTCATGTTTGTGGAGGTTTTTAATACTCATTTGTATTCCGCTGGCTGTTTACATCAGCGTTTTCTATGCACATCTAAATATGTTACCAAAAGCTGGACCACATGATAGTGTAATGACAAGTGCCTTCCAGGCATCCTTACAAGGAGGTCTTGCTAGCATTACAAGGGGGCAACCGTTGCATGTGTCACACGGCTCACAGATAACATTAAGGCACTCCCATGGTCGTACATGCTGGCTGCATTCTCACGCACACGTTTATCCTGTGCGCTATGCCGATGGTCGAGGTTCCTCCCACCAGCAGCAGGTCACGTGCTACAGTTTCAAGGACGTCAACAACTGGTGGATCGTTAAACGTCCGGAGCAATCTTCTCTGGCCGTCTCCCAGCCACCAGACGTTATACGACATGGCGATGTAGTTCAACTGCTCCATGGCATCACAAGTAGGGCGCTAAACAGTCATGACGTTGCTGCGCCAGTTTCCCCACAATCACAGGAAGTTTCCTGCTACATCGATTACAATGTATCGATGCAGGCCCAGAATTTATGGAGGGTTGATATAGTGAATCGTGAGACTGAGGAGTCCACGTGGGATAGCATACGTTCGCTGGTGCGTCTCGTGCACGTGGACTCGGGCTCCGCTCTGAGGTTCAGTGGCCGACAGCTGCCTTCGTGGGGCTTCCACCAACACGAGGTCGTCGCTGACAAGGCCATATCCCACCAGGACACCCTGTGGAATGTTGAGGAACATCGCTTTACAAAAGCTGAAGACCGTCGTGAACGTGAGAGGGAACTGGTAACAGCAGAAATGATCCCCACTACGACGCCGCGGCTCTCGTTCTGGGATAAGTTCTTAGAGCTCCAGTATAAAATGATAACACACGCGCCGGACGCGCCAGTCGGTCACATGTTCGCTAGCGAGCCGGCTGAGTGGCCGCTATTGGTGCGCTCTATCGCCTACTGGCTGTCTCCTGACTCTAATGCGCAAGTGCATTTGATTGGAAATCTAGTGACCTGGTATGCTGGTACGATCTCCGTACTATTTTACGGTGTTCTGCTCGCTGTGTACGCGATCCGACAACGTCGTGCGTACCAAGACCTGTCACCCAGGGCATCTCACAAATTCTATGAATCTGGATATATATTATTTTTAGGATACTGGCTACATTATCTACCATACTTCTTTATGGATAGAACGTTGTTCCTCCATCACTATTTACCTGCTTACATTTTTAAAATCCTTCTCTTATCCTATGTTATTGACCATGTATATTATATCTTAGGCGCCCGTGAAAATAGTAAGTCTTTTTCTCATATATTTATATTATGTGTCATCATTTGGCTCTCTTATGTTATGATAGCATTTAAAAAGTTCAGTGTTTTAAGCTATGGCAACACTGATCTGACTGAAAATGATCTTTTGAATCTCAGATGGAAAGACACTTGGGATTTCATAATACATAAAAAGACTTAA

Protein sequence:

>DPOGS211166-PA
MANEISLKQRKTSKKSNEVSKLSEADLADSINSSIIAKREVGDGAAAENEVEYDKNTDHKDELEKPIRSIYFRIEIDVVFLLLLLLALATRLYKLEEPNYIVFDELHYGRYVTLYTKGIFFFDAHPPLGKQLLYLAGQWAGYNGNFTFDRIGSPYSDSVPIRALRLVPAVCGSLLVPVTYQLMLEICTYQFTAILAAALVLFENCFLAQSRFMLLESIQILFGLWGLLCIIKSSRKTGAASVIWLCVGAFSLGCCFSVKYSGLYTYYLATFLVGRQMWQLIGKIKSMLKLFLSCLWRFLILICIPLAVYISVFYAHLNMLPKAGPHDSVMTSAFQASLQGGLASITRGQPLHVSHGSQITLRHSHGRTCWLHSHAHVYPVRYADGRGSSHQQQVTCYSFKDVNNWWIVKRPEQSSLAVSQPPDVIRHGDVVQLLHGITSRALNSHDVAAPVSPQSQEVSCYIDYNVSMQAQNLWRVDIVNRETEESTWDSIRSLVRLVHVDSGSALRFSGRQLPSWGFHQHEVVADKAISHQDTLWNVEEHRFTKAEDRRERERELVTAEMIPTTTPRLSFWDKFLELQYKMITHAPDAPVGHMFASEPAEWPLLVRSIAYWLSPDSNAQVHLIGNLVTWYAGTISVLFYGVLLAVYAIRQRRAYQDLSPRASHKFYESGYILFLGYWLHYLPYFFMDRTLFLHHYLPAYIFKILLLSYVIDHVYYILGARENSKSFSHIFILCVIIWLSYVMIAFKKFSVLSYGNTDLTENDLLNLRWKDTWDFIIHKKT-