Monarch geneset OGS2.0

DPOGS210285
TranscriptDPOGS210285-TA2592 bp
ProteinDPOGS210285-PA863 aa
Genomic positionDPSCF300216 + 260637-267844
RNAseq coverage501x (Rank: top 25%)
Annotation
HeliconiusHMEL0035210.075.23% 
BombyxBGIBMGA002273-TA0.074.32% 
Drosophilawfs1-PB1e-13334.56% 
EBI UniRef50UniRef50_D6X0P31e-17139.31%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6X0P3_TRICA
NCBI RefSeqXP_973480.12e-17239.31%PREDICTED: similar to AGAP004409-PA [Tribolium castaneum]
NCBI nr blastpgi|910899534e-17139.31%PREDICTED: similar to AGAP004409-PA [Tribolium castaneum]
NCBI nr blastxgi|665251505e-16439.06%PREDICTED: wolframin [Apis mellifera]
Group
KEGG pathwaytca:6622786e-172 
 K14020 (WFS1)maps-> Protein processing in endoplasmic reticulum
Orthology groupMCL14470 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210285-TA
ATGCCATCGGGACGGAAACGATGGAATTTGCATGATGGACCTCAGGGCTCGTTGCGACGTCTCCGCAATCAGCTCGCCGAGGACGGTTGTGCAGAATCACAAGTAGTTCTTGCCAAACAATTGTTAGAAGAAAAGTGTGAACTAGAAGCGGATAAGATATTATATATATGTATATATATATATATATATAAATACATCTGTATTCGGAGTGGTGTTATAGACGAAGACAGCGCTCCAATAGTACGGGCCAAGTCATGTTTAGCAGCGAGCCGTCAAGAGACGGTTGCAAGGAAAGCTGCCAGGGATCTGTTTGCGAGTTTATCGAACGGCGAGCAATATATAACAACGGCTCAGTTGGAGAGGCGCATAAGGGAGATATGCGCCACCACTCTCAGGAAGGGCCCCGGCGAGGACTCACTGGACGATGAGACGCCAGAAGAAGAGAGGGCGTTGGATGAACAAGCCTTAGAACCTTCCAAACTACAAGCTGGTGACCACCTCCATACCAACGGTACCATACGGACTCACGATGTGGATGAAGATTATCCTGAAAGATGCCAGAATGATGACATACGGAACCTAACTGTGGATAATTTAGTATCAGCTGCGGTAGATTACTGTCAGGGAGAGTTGCCGTTAGTTAGCTACGAATTGACTTTAACGGACCCAAGCGTGAAGGCTCTAGATCACATTCCCATATTACACCAGGCTTTTCTGCATCCCATAGTCTTTTTGCAGGTTTTGTATCTGAAATTGCTCTACTACTTTGGGTCCTTCTCCTTCAGACTGTCGAATATCGAATTGTCAGTCCTCCTAATATGTTATTTGTCCGTAAGCACGGACAGTTTGTATCATCTAGTCCCGCTAGTTCTCTATTACGCCAGCATAATCGCTATGGTTATTTGTACATTCAAGATGCTGCTGGCCAAACGCCAGTTTATAGATTTCCGGAAGTGGTCGGGTCTGTTCTTGCGGTACAGCGACGGTAACTTGCAGCCGGACGAGTCGGAAGATCTGTTTGTCCGGAACAACCTGGCGCCGTTTGTCCAATTCTTCCTGGCGTTGTTCGTGAACCTGTTCTTGTATCCATTCATCGCTACCCAATGGGTGCCGTTCTCTGAATTCTGTGTTTTGTCCTTCTGTCTAATGTTTCTGACGCTATTGTCGTTTGGCACGAACGGGAGTCCTTACCCGGACGTGTTGGCTCTGATCTCCTTCGGTATAAACGTTCTCGCCAAATATCCTTACGAGAAGGATACTGTTGTACATCAAGGTTGGAGGTTCTTGGATTTGCACATATCCAATTATCCGTCTTATATACTTGGGAATAGCATAGAATTCTGCTTGAACGCCCGAGTCTTCTTCTCTCTACTGATACCGATAATCCTAGCCGTGATGGCGAAGAGGAATAACTGGCAGGGCGTATTGAAGTACACCCTACCACACTGCGTGACCTTGAGCTGGTTGCAGATGTTCATAACGTGTTCTCACGGCTGTACGACCTATGGTTTGATAAGGGGAACCCTGGCTTTAGTTTGTACGTTTCTCTTCTTACCCCTAATGGGCATAGTGACAGTGACCTTACCAATAATAGCCTTCCTTCAATACGTCACCATGTCCAAACTATTGTATACATTGACGGTCCTAATTTGGCTGGTCGTAGGTTTAACTGTCACGTGTTTCCTTGCGAAATCCGAAGCCACAAAGAAGTTTGTCACTCCATTTCAGATCGCTATAGGTATGATAACGTTAATATACACCGGAAACCAATTCGCTGTGAACATCCAGGAGGACGGCATCGCGTCTGGGCTCATAGAGATCATCGGAGATGAAAAATCCAGCATCAAAAACCTGCTGAAGAACGAGTTTATGACAGACTACGAGGATAGCTTGTATCACATCAACTGGGACGATTATTACAACCAATGTAACACGCCATCTTGGAACGAGAGGAATATGGCAACCACTCAGATGAAGTGTTCCGTACTGGACGGAGCTCACGTGAACTGGGAGGGTTATGTTAAAGATGTCAGAGTGAAAAGTGTCAGGAATCAATGGAACACTGTCGCTGGCTGGTTGCCCCAAATAATGTCGGAGTATTTCAAATGTTACTACGGCGAGGAGTTTTCCAGTTTGTGCCGTAACGACGTGGCCGATTGCGAATTTGTGCAGACTGTGGCTGAAGAGAGCGGCAAGAGCTGCCATTTGAATAATTTTAATGAATACACATACGAAATAACAGTGAATATGGAAGCGAATGGCGGCCTATTGAAACGTCACTCAGAAATCATTTTGACGTTTGACAATTTATTTACAAACTTTACGCGCCTTTTACGTTCGGACGATAAGATCAATTTTAAAGGTGTTTTACTGAATGATCACGATTCATATAATATTGGTCATAGAAATCTAAATATTAAAGGTTACGAATTAAAATGCATTGAGTGTAAAGAAGCTAGGGGTGCTGTCAGCTCGAGGACGCCGTCTCCATCGAAGCTGTCAGAACTTTTTCGGACAATTGTTAATGATTGCGTCGTTTCAGCTAAATACATATTGAATTTCTTATTAAATCCCATTGTTGTTATCAAATAA

Protein sequence:

>DPOGS210285-PA
MPSGRKRWNLHDGPQGSLRRLRNQLAEDGCAESQVVLAKQLLEEKCELEADKILYICIYIYIYKYICIRSGVIDEDSAPIVRAKSCLAASRQETVARKAARDLFASLSNGEQYITTAQLERRIREICATTLRKGPGEDSLDDETPEEERALDEQALEPSKLQAGDHLHTNGTIRTHDVDEDYPERCQNDDIRNLTVDNLVSAAVDYCQGELPLVSYELTLTDPSVKALDHIPILHQAFLHPIVFLQVLYLKLLYYFGSFSFRLSNIELSVLLICYLSVSTDSLYHLVPLVLYYASIIAMVICTFKMLLAKRQFIDFRKWSGLFLRYSDGNLQPDESEDLFVRNNLAPFVQFFLALFVNLFLYPFIATQWVPFSEFCVLSFCLMFLTLLSFGTNGSPYPDVLALISFGINVLAKYPYEKDTVVHQGWRFLDLHISNYPSYILGNSIEFCLNARVFFSLLIPIILAVMAKRNNWQGVLKYTLPHCVTLSWLQMFITCSHGCTTYGLIRGTLALVCTFLFLPLMGIVTVTLPIIAFLQYVTMSKLLYTLTVLIWLVVGLTVTCFLAKSEATKKFVTPFQIAIGMITLIYTGNQFAVNIQEDGIASGLIEIIGDEKSSIKNLLKNEFMTDYEDSLYHINWDDYYNQCNTPSWNERNMATTQMKCSVLDGAHVNWEGYVKDVRVKSVRNQWNTVAGWLPQIMSEYFKCYYGEEFSSLCRNDVADCEFVQTVAEESGKSCHLNNFNEYTYEITVNMEANGGLLKRHSEIILTFDNLFTNFTRLLRSDDKINFKGVLLNDHDSYNIGHRNLNIKGYELKCIECKEARGAVSSRTPSPSKLSELFRTIVNDCVVSAKYILNFLLNPIVVIK-