Monarch geneset OGS2.0

DPOGS205878
TranscriptDPOGS205878-TA1779 bp
ProteinDPOGS205878-PA592 aa
Genomic positionDPSCF300339 + 84073-88617
RNAseq coverage469x (Rank: top 26%)
Annotation
HeliconiusHMEL0149970.083.92% 
BombyxBGIBMGA000120-TA0.075.26% 
DrosophilaVps26-PA1e-15177.64% 
EBI UniRef50UniRef50_B4JMX61e-15081.67%GH24707 n=3 Tax=Eukaryota RepID=B4JMX6_DROGR
NCBI RefSeqXP_972999.19e-16081.27%PREDICTED: similar to vacuolar protein sorting 26, vps26 [Tribolium castaneum]
NCBI nr blastpgi|910845412e-15881.27%PREDICTED: similar to vacuolar protein sorting 26, vps26 [Tribolium castaneum]
NCBI nr blastxgi|910845417e-15281.27%PREDICTED: similar to vacuolar protein sorting 26, vps26 [Tribolium castaneum]
Group
Gene OntologyGO:00309049.9e-133retromer complex
GO:00070349.9e-133vacuolar transport
KEGG pathway 
InterPro domain[6-281] IPR0053779.9e-133Vacuolar protein sorting-associated protein 26
[482-516] IPR0090691e-05MTCP1
Orthology groupMCL11915 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205878-TA
ATGAGTTTCTTTGGGTTCGGACAAACCGCGGACATCGAAATTGTATTCGACGATGCTGACAAACGAAAAGTGGCCGAAGTTAAAACGGACGATGGTAAAAAAGAGAAACTGTTGCTTTATTATGATGGTGAAACTGTGTCGGGGAGGGTTAATGTGACGCTGCGGAAACCAGGATCGAAATTAGAGCACCAAGGTATCAAAGTTGAGCTTATCGGTCAGATAGAGTTGTTTTACGACAGAGGAAATCATCACGAATTTATATCGTTGGTTAAAGAACTCGCTCGTCCCGGAGATCTATTGCAGCACACCTCCTATCCGTTCGACTTTGCGAACGTTGAGAAACCCTATGAGGTGTACACAGGAGCCAATGTCAGGTTAAGGTACTTTTTACGAGCCACAATAGTAAGACGTCTTACAGACATCACTAAAGAGGTGGACATAGCCGTTCATACGTTATGCAGCTATCCCGATGTACTAAACTCTATAAAAATGGAAGTAGGCATCGAAGATTGTTTACACATAGAATTTGAGTACAACAAATCAAAATACCACCTGAAAGACGTTATAGTAGGTAAAATTTATTTCCTCCTCGTACGAATCAAGATAAAACACATGGAGATATCTATTATAAAGAAAGAAACGACAGGTTCTGGACCTAACACCTTCACAGAGAATGACACAGTCGCTAAATATGAAATAATGGACGGTGCACCAGTTAGAGGTGAAAGTATTCCTATTAGAGTATTTTTGGCTGGCTACGATCTAACTCCTACTATGAGAGACATAAACAACAAATTTTCGGTAAGATACTTTCTAAATCTTGTTCTAATGGACACAGAAGATCGCCGTTATTTCAAACAACAGGAAGTTACTCTGTGGCGGAAAAGTGACAAATCACGACTTCCGCTACACAATCCGCATCATCCTCAGAACTTAACGAATTCGCAGCACTACCAAATGGCTGTTTCCAGCGAAGAGAACTTGGCAAGAGGTATTTCCCCATCAATGCCACCAGAAAGTGCATTACAGAGATCTATTTCACCTCCAATGCCAAATGTTGATAAACATAACGGCCCCTCACAAATGGAACAAGAAGAACCAGATGTATTACCAAACAAACTGTCAAGTACTCACATCGAGAATGAGCCCGAACAGGTTGAACAAGAAAAGACGAGTGAAAAGCCCAAACTAGCCGAAAAGCCACTAGATAAACCACAACTAGAGGAAGTCCAGGAAGTGAATAATTCAGAGCATATCAAAGAAAAGCCACAGAATGTTAGCAAACCTCAAATATCAATAAAACCATCGGTTTCGGAAAAACCTATAGCCGAGAAAGTTGCCATCGCCGAGAAACCGTTATTGGCAGAGAAGCCCATACTAGAAAAGCCCACTTTGGCCGAGAAGCCAGTCCTCTCACAGACGGAAAGTGTAGAAGCAGCTACGAAAAACAACTTCCAAGAGGTCCGCTGTGATCGCGTCTTCGAGGCTATGCGACAGTGTTGTCTAAAACATAAACCCGTGTCATTAGTTTGCGAAGGTTACCGCTTGGAGCCGAGGGTTTTCGCCCCCGTGACTGATCGACCAGCTAAGGAGAAAGCTAAAAATAGAATAGCGAAATATCCACTAATATTCGCGAAATGTTCTAAACAAGGCAGCCTATACGCGAAATGTGTCCTGCTCAGAGAGGATTCTGTGAGAAAAGACGACTGTGCTAAAGAATTTAAAGAATTCAACGCTTGTTTACAAACAGCCGCAAAAGAACTCAAAACAAGAATATAA

Protein sequence:

>DPOGS205878-PA
MSFFGFGQTADIEIVFDDADKRKVAEVKTDDGKKEKLLLYYDGETVSGRVNVTLRKPGSKLEHQGIKVELIGQIELFYDRGNHHEFISLVKELARPGDLLQHTSYPFDFANVEKPYEVYTGANVRLRYFLRATIVRRLTDITKEVDIAVHTLCSYPDVLNSIKMEVGIEDCLHIEFEYNKSKYHLKDVIVGKIYFLLVRIKIKHMEISIIKKETTGSGPNTFTENDTVAKYEIMDGAPVRGESIPIRVFLAGYDLTPTMRDINNKFSVRYFLNLVLMDTEDRRYFKQQEVTLWRKSDKSRLPLHNPHHPQNLTNSQHYQMAVSSEENLARGISPSMPPESALQRSISPPMPNVDKHNGPSQMEQEEPDVLPNKLSSTHIENEPEQVEQEKTSEKPKLAEKPLDKPQLEEVQEVNNSEHIKEKPQNVSKPQISIKPSVSEKPIAEKVAIAEKPLLAEKPILEKPTLAEKPVLSQTESVEAATKNNFQEVRCDRVFEAMRQCCLKHKPVSLVCEGYRLEPRVFAPVTDRPAKEKAKNRIAKYPLIFAKCSKQGSLYAKCVLLREDSVRKDDCAKEFKEFNACLQTAAKELKTRI-