Monarch geneset OGS2.0

DPOGS209212
TranscriptDPOGS209212-TA1878 bp
ProteinDPOGS209212-PA625 aa
Genomic positionDPSCF300061 + 1193086-1197048
RNAseq coverage497x (Rank: top 25%)
Annotation
HeliconiusHMEL0216651e-9841.46% 
BombyxBGIBMGA001982-TA6e-14248.51% 
DrosophilaPngl-PB1e-9134.84% 
EBI UniRef50UniRef50_UPI0000D569DB4e-10939.04%UPI0000D569DB related cluster n=1 Tax=unknown RepID=UPI0000D569DB
NCBI RefSeqXP_975407.17e-11039.04%PREDICTED: similar to peptide-N(4)-(N-acetyl-beta-glucosaminyl)asparagine amidase [Tribolium castaneum]
NCBI nr blastpgi|910871771e-10839.04%PREDICTED: similar to peptide-N(4)-(N-acetyl-beta-glucosaminyl)asparagine amidase [Tribolium castaneum]
NCBI nr blastxgi|910871777e-10639.39%PREDICTED: similar to peptide-N(4)-(N-acetyl-beta-glucosaminyl)asparagine amidase [Tribolium castaneum]
Group
KEGG pathwaytca:6643072e-109 
 K01456 (E3.5.1.52, NGLY1, PNG1)maps-> Protein processing in endoplasmic reticulum
InterPro domain[467-625] IPR0089792.7e-20Galactose-binding domain-like
[264-319] IPR0029311.9e-16Transglutaminase-like
Orthology groupMCL14835 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209212-TA
ATGGAGGATATGGCACGTTTAGCGCTGGTTGAGCAGAGCGTCAAAGATGAAAATCAATATAGAAAAATACTCTTCGATCTTTTGGACCACATAAGCAATATTTTAGATAATCCCCACGATTACGATTTGAGGACTATTAAGAGTGATATCCTAGAAAAAGTGTTAGACTGCGAGGCCTTCGCCGACTACCTGAAATACATTGGATTTCAAATGGGACAAAAAGAAATTATGTTTCCCAGAGAACAAACTTTAAGTAAATTGAGAATAGCACAAGCTGCTCTGGAAAGAAAAATAGGTTTTTGTTGTGGCAATCTTAACAAGACTGTTACTGTAACAGATAACAAGAAGACCAGTAAACCAAAACTTACAGAAGCCAATATATTGGTAACCAATAATTCATTTTTATTAAGAATCCAGTCAGTTTTCAATGACATGATAAGGTATGAGGATGAAGAGCTCCAACAGAGAGCCAGGGAACACATCCCACTTGTGACGCTGCAGCTCATGGCGCTGGACAGGATGAGGGAGCATCAGAAGAAAATTAAAATGGGTGAGATCAAAGATCAAGACATGTCTTTCGACATGGCCTTGTTAATGGAACTGATTGTGTGGTTCAAAAACAAGTTCTTCACATGGGTCGATCAACCAGCCTGTGACAAGTGTGGCGGAGCGACATCATTAGTCAGTGTTAGCTCCGTCAAGACTGACTTAGAAACTTGTAGAGCTGAGCTATATAAATGCTCGTGCGGCCGCGACGTGCTCTTCCCGAGGTACAACAATCCCATCACCTTACTGAGAACGAGACGCGGCCGCTGCGGGGAGTGGGCTAACTGCTTCACGTTGATGTGTCGTGCGCTCGGATACGACACAAGATACGTGTACGACACGACCGATCACGTGTGGTGCGAGGTGTTCGACCAGGACTCCCAGCGCTGGTTGCACGTGGACCCCTGCGAGGGGTGTCTGAACGCTCCCCTGATGTACGAGCACGGTTGGGGGAAGTCCCTCACATACATCATAGCGGTCTCACGAGATGACCTCCAGGACGTAACGTGGAGATATTCCAGCCACCATAAAGCGCTGCTGCAGCGTCGCGATGAAGTGTCCGAAGCGGACCTGGTGCTAGCTATCCTGGCTCTCCGTGATCATCGTCACGACCAGGCGAGTGAGCTAGTCCAATTAGCTACATTGAGTTATGTTTTGAGTGTGAGTCCGGCCAGAAGGAGGTACCTCGTCATCAGGACATTGAAGGAACTTGTGGAATTGATGGTGGAGAGGAAACCCGGCGAAATGGAATCCCACGGCCGGATATCCGGTTCGAAGGCGTGGCGGATGGAGCGCGGTGAGACGGGAGCGAGGAAACACGCGTTCATACTAACAGAACCCGGCGACCATTGTGTACAATACCGTACCAGCTCAGATACATATAGAGTACTACTGAATAACGTACAACGAGACGAGATAAAGAGCTGGAGGGGGGGAGTGTTCCACAGCGAGAACATGTTCAGGAAGTTGGAGACGGACTGGCAGCAGACCTACCTCGCCAGGGAGGAGGGAGAAAACACCGGCAGTATATCGTGGAAGCTGATGGTGGAAGGAGATTTGGTGATATCGAGCGTAGCGATGGACGTCACCACCGCTCAGTACGAGGATGGCAGGATAGAGTGGACCTACGAGGTGGACCACCAGCCCCCGAGGACCTTCAGCTTGAACGCGGGTCGGTGGCAGGTAGACGGTAGGTGGTCATCGTGTGAGGTGAGAGCTCGTCTGGTCGGGGGGAAGGGGGTGGTAGCCTGGCAACACGCGCAGATCGCCCGACAGCACACCTCCGACGACAAACCAGCGCTCAGCCTGCTAGCCACCGTCGTGCCACGGTGA

Protein sequence:

>DPOGS209212-PA
MEDMARLALVEQSVKDENQYRKILFDLLDHISNILDNPHDYDLRTIKSDILEKVLDCEAFADYLKYIGFQMGQKEIMFPREQTLSKLRIAQAALERKIGFCCGNLNKTVTVTDNKKTSKPKLTEANILVTNNSFLLRIQSVFNDMIRYEDEELQQRAREHIPLVTLQLMALDRMREHQKKIKMGEIKDQDMSFDMALLMELIVWFKNKFFTWVDQPACDKCGGATSLVSVSSVKTDLETCRAELYKCSCGRDVLFPRYNNPITLLRTRRGRCGEWANCFTLMCRALGYDTRYVYDTTDHVWCEVFDQDSQRWLHVDPCEGCLNAPLMYEHGWGKSLTYIIAVSRDDLQDVTWRYSSHHKALLQRRDEVSEADLVLAILALRDHRHDQASELVQLATLSYVLSVSPARRRYLVIRTLKELVELMVERKPGEMESHGRISGSKAWRMERGETGARKHAFILTEPGDHCVQYRTSSDTYRVLLNNVQRDEIKSWRGGVFHSENMFRKLETDWQQTYLAREEGENTGSISWKLMVEGDLVISSVAMDVTTAQYEDGRIEWTYEVDHQPPRTFSLNAGRWQVDGRWSSCEVRARLVGGKGVVAWQHAQIARQHTSDDKPALSLLATVVPR-