Monarch geneset OGS2.0

DPOGS208223
TranscriptDPOGS208223-TA1776 bp
ProteinDPOGS208223-PA591 aa
Genomic positionDPSCF300079 - 625949-633895
RNAseq coverage245x (Rank: top 42%)
Annotation
HeliconiusHMEL0079147e-12186.75% 
BombyxBGIBMGA006411-TA3e-16275.22% 
DrosophilaVps16A-PA3e-7238.72% 
EBI UniRef50UniRef50_D7GXI53e-9947.51%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D7GXI5_TRICA
NCBI RefSeqXP_971196.15e-10047.51%PREDICTED: similar to vacuolar protein sorting vps16 [Tribolium castaneum]
NCBI nr blastpgi|910952471e-9847.51%PREDICTED: similar to vacuolar protein sorting vps16 [Tribolium castaneum]
NCBI nr blastxgi|910952477e-9447.51%PREDICTED: similar to vacuolar protein sorting vps16 [Tribolium castaneum]
Group
Gene OntologyGO:00068863.3e-83intracellular protein transport
GO:00057373.3e-83cytoplasm
GO:00055152.1e-07protein binding
KEGG pathway 
InterPro domain[6-358] IPR0069263.3e-83Vps16, N-terminal
[353-586] IPR0069254e-68Vps16, C-terminal
[41-295] IPR0110462.1e-07WD40 repeat-like-containing domain
Orthology groupMCL15037 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208223-TA
ATGTCAGCTTTATTAACAGCAGATTGGTTTCATCTAGATTCCTATTACAGGAAATTTGATTTGTATAACATGGTATGGTCAATGGACGAAGGCCTTGGCAACATGATTGTTAGTGGAGCACAGTATGGAGGTCCTATTGCTGTTGTTAGAGACAGAAAACAGTTTGTAAGGATTGTGACGACAGCTAAGCCCGTCATCACTATATATAACTGTGTTGGGAACATTATATCAAAGATTCTGTGGAACAATGGAGTTTTAATACACATGGGCTGGTCGGACGGTGAACAGCTTCTATGTATTCAAGAGAGTGGTGATGTTTTGATATATGATATGTTTGGGGCATACCAAAAGACATTCAGTTTAGGGCAAGAAGTTAGAGATACAAAGGTCTGTAAAGCCCAACTATTTTCAAATCCCCACGGTAAAGGCCTCGCTGTCATAACAACAACAAACCGGATATTCTTATTGAGTAATGTATCTGAACCTAAAATAAGAACAGTGCCTGAAATACCTAGAGCGAATGAACCGATCAGCTGTTGGTGTGTCATCAGTTCTGACCCACGAGTTGCACCAAATATTGCTGTACTCGTATGTAGGGATAAGGAAGTGTATAGATGTCAGCTCGGTGAATCCAGACCTATACTTATGCGACCGGATATAACAAATGCCTTTACCCTAATTCTAACAATAGTGCCATCAGATAACGGTCGTCATGTAGCACTGTTCACTGACTCTGGTTTCCTCTGGCTCGGATCATCAGACTTCAAGAGTAAATACTGTGAGATTGACACAGAGTATATAAAACAACCCAAAGAGCTGCTATGGTGTGGATCCCAAGCAGTGATAGCACATTGGGATGACACAATGTGTATATACGGCCTTAATGGGACATCCGTGAAGTACCCCTACGACGGACCCTTCCACCTCATACAAGAAAGTGACTGTGTGAGAGTTATATCGGAAATGACACACGAGTTGATACAAAAAGTACCAGTAGTAGTGGAAAAGATATTCAGGATCAATAGCACAGCGCCCGGATCGTATTTGGTTGAAGCTTCGAAACAATTCCAGCTGACGATAGTTAACTTCAAACTGGCTCATTCTCTGTACATAAAGTACTGCGCGAGTCACAACAGAGAGGCTCTGCGGAAGGTTTACGTGCAGGAGGACGATTTCCACGGACAGGCGACCACACACGTCAGGGACGCCATCGAACAGAGCAATCCCGGAAGTGTAGAGGCATCCCTGATATCCGCTCGGGAATGTTACAGGAAGGGCAGGAACGATCTTGGAGTTTCGATATGCGAAGAAGCTCGCAAGCTCTGCAAGCAGCAATCAAGTCTGCAAGAGACGTATGGAGAGAGTTTTGTTGGACTCTCACTTCACGACACCGTGAAGAAACTTCTAGAACAGGGGGAAATAAAATTGGCTGATAAACTGAGATCCGAATACAGGATGCCGGATAAAAGGTACTGGTGGCTGAGAATATTAGTGATGGCGGATAACTATAAATGGGACGATTTAGAAAAGTTCTCCAAATCTAAGAAATCACCGTGCGGCTATGAACCGTTCGTGGACGCTTGTCTCAAGTACGGGAAGAACGACGAGGCACTGAAATATCTATCGAAGTGCCGGGACGACATCAAAGTCAAATATTACGTCAAGGCCGAATTCTATGAAGAAGCAGCTCAAGTAGCATTCGAACAAAAGGACGAGAGCGCGCTTTTGTTCGTACAAAATAAATGTCCTTCGTGTAATCTTAATAGCAGCCTGTAG

Protein sequence:

>DPOGS208223-PA
MSALLTADWFHLDSYYRKFDLYNMVWSMDEGLGNMIVSGAQYGGPIAVVRDRKQFVRIVTTAKPVITIYNCVGNIISKILWNNGVLIHMGWSDGEQLLCIQESGDVLIYDMFGAYQKTFSLGQEVRDTKVCKAQLFSNPHGKGLAVITTTNRIFLLSNVSEPKIRTVPEIPRANEPISCWCVISSDPRVAPNIAVLVCRDKEVYRCQLGESRPILMRPDITNAFTLILTIVPSDNGRHVALFTDSGFLWLGSSDFKSKYCEIDTEYIKQPKELLWCGSQAVIAHWDDTMCIYGLNGTSVKYPYDGPFHLIQESDCVRVISEMTHELIQKVPVVVEKIFRINSTAPGSYLVEASKQFQLTIVNFKLAHSLYIKYCASHNREALRKVYVQEDDFHGQATTHVRDAIEQSNPGSVEASLISARECYRKGRNDLGVSICEEARKLCKQQSSLQETYGESFVGLSLHDTVKKLLEQGEIKLADKLRSEYRMPDKRYWWLRILVMADNYKWDDLEKFSKSKKSPCGYEPFVDACLKYGKNDEALKYLSKCRDDIKVKYYVKAEFYEEAAQVAFEQKDESALLFVQNKCPSCNLNSSL-