Monarch geneset OGS2.0

DPOGS213220
TranscriptDPOGS213220-TA2430 bp
ProteinDPOGS213220-PA809 aa
Genomic positionDPSCF300114 + 470503-476585
RNAseq coverage494x (Rank: top 25%)
Annotation
HeliconiusHMEL0170810.071.03% 
BombyxBGIBMGA007414-TA9e-11366.87% 
Drosophilap115-PA0.056.66% 
EBI UniRef50UniRef50_E3WNX50.055.07%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3WNX5_ANODA
NCBI RefSeqXP_972184.10.059.65%PREDICTED: similar to vesicle docking protein P115 [Tribolium castaneum]
NCBI nr blastpgi|910885290.059.65%PREDICTED: similar to vesicle docking protein P115 [Tribolium castaneum]
NCBI nr blastxgi|910885290.059.65%PREDICTED: similar to vesicle docking protein P115 [Tribolium castaneum]
Group
Gene OntologyGO:00481930Golgi vesicle transport
GO:00068866.6e-68intracellular protein transport
GO:00482806.6e-68vesicle fusion with Golgi apparatus
GO:00057376.6e-68cytoplasm
GO:00001396.6e-68Golgi membrane
GO:00054881.1e-17binding
GO:00160203.7e-08membrane
GO:00085653.7e-08protein transporter activity
KEGG pathway 
InterPro domain[3-781] IPR0240950Vesicle tethering protein p115-like
[360-625] IPR0069536.6e-68Vesicle tethering protein Uso1/P115-like , head domain
[27-325] IPR0160241.1e-17Armadillo-type fold
[682-783] IPR0069553.7e-08Uso1/p115-like vesicle tethering protein, C-terminal
[16-210] IPR0119897.7e-06Armadillo-like helical
Orthology groupMCL13721 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213220-TA
ATGGATTTTCTTAAAAGCGGTCTCAAAACAGTCCTCGGAACACCGGAGTCTGGTCAACAACCATCGGTCGCTGAAACGGTTGAGCGTTTGGTAGAAAGAGCAAGTAATTCCACCCTCCTGGAAGATCGTAGAGATGCTTGCCGGGCATTGAAAGCTATGTCCAGGAAATACAGGCTTGAAGTTGGTGCTCAAGGGTTAGATACTCTGAGACAGATTTTAGAATTAGACAGAGCAGACAACGAAACTGTGAACTATGCGTTGGACACGTTAAACAATATTGTGTCTCCGGCTCAATTTGAGGAAGAAGAGGATAAACCTCACATACCAATGAACATCGGCGACCAGTTCACGGAAATGTTCATCAAAGACCCGCACAATATACAATTAGTGTTGGATCTTCTCGACGAATATGACTTCCGAGTTAGGTTGTCTGCGGTGCAGCTTCTCGTGTCCGTGCTCACAAACAGAACCAAGGACATCCAAGAGATCATTCTAGATAAGCCAATGGGTGTGTCCAAAATGATGGATCTCTTAGCAGACACGAGGGAAGTCATCCGAAACGAAACACTGTTGTTGCTGATCAAATTGACGAAGGGCAATGCGAACATACAGAAGATTGTGGCCTTCGAGAACGCCTTCGACAGACTGTTCGAGATAGTGACCAGCGAAGGATATTCAGATGGGGGGATCATCGTGGAAGACTGTCTGTTGCTGATGTTGAATCTGTTAAAGAATAACAGCAGCAATATAAATTTTTTCAAAGAGGGTAGTTACATACAGAAGATGTTGCCGATGTTCAACATTCCGGAGAACTCTGAAGAGGTCGGCTGGTCGCCACAGAAAGTTGTTAACGTGCATTGTATGCTGCAGCTGGTGAGGACTCTGGTGTCCCCCAGCAACTCTATCCAGATTATATCGAGTTGTCAGAAAATTATGAAAAATGTCGGTCTCTTGGATGCGCTATGTAACATCCTCATGGCGAGTGGAGTGCCAGCGGACATACTGACGGAGACGATCAACACTGTTGGTGAAGTGGTCAGAGGCGACGCCACTAACCAGGACTTCATAGGAAACGTCATCGCACCGTCGTATCCCCCGCGGCCGGCGATTATAGTGCTCTTGATGTCTATGGTCAACGAGAAACAGCCTTTTGCATTGAGATGCGCCGTGCTGTATTGCTTCCAATGTTATCTATATCACAACGAGAGCAGCCAGTCTAACTTGGTCCAGACGTTGCTGCCGTCGTCGTCGGACGTGTCGAGCCTGACGAGCGGTCAGATCCTCTGCGGGGGTCTGTTCTCGTCGGACGTGCTGTCGAACTGGTTTTCAGCCGTGGCCTTGAAGCACGCGCTCATCGATAACCCCACGCAGAAGGAACAGGCGCTAAGAGTGTTACTCGCTACCAACATAGGCAGCACGCCGGTGTCGCTGCTCCACCAATGTACTCTGCTGCTGCAGCAAACCACCAAGCTACAGTCCAAGGTAGCTCTCCTCATGTTGCTGTCGACGTGGACGGCGGGCTGTTCCGGGGCCGTGGCGGCGTTCCTGGCGGCCCCGGGCGGAGTCCCGCTGCTGGTGCATCACGCCGGGAGCAACGAACACGACGACAACGAGTACCTGCTGCAAGGTCTGTCAGCGTTCCTGCTGGCTATATGTATCCACTTCAACGACGACTCGGTGGCCACCTACAGCAGGGACGCCCTCAAGCAGTTGCTGGTGAAGCGGATCGGCATGGAGACGTTCGTGGCCAAGCTGAGCGACGTGTCCAAACACGAACTCTACAACAGGGCCGCCAAACATCCGCAACTGAGAGTCGCCTCGCCCTCAGACGTGCTCATCGACTACGAGTTCTGCAAACTCTTCAAGAGTCTAGAAGGTCTGGAAGGCCCCCTAATGGAGAGCGTGTCTATGAGTCCGGGCAGGTCGTCCCCGGAGCCCCAGCTGGAGCAGTACAAGTCCCTGATCCGTCAACAAGACGCACGCCTCCAGGAGCTGGTGCAGCAGCTGGAGACGCTCACGGCACACGCACAAAACCTCCAGGGCGCTCTGAACGAGGCTCAGTCAGCCAACTCCCTGCTCAGAGACGAGAACACACTGCTCAAGGCGCAGGTCGGGAACTCCGGCTCGGACCACGAGGACAGGATACGACAGCTGACCGGGGAGGTGGCCAGGCTCAAGGAGGAGTTGGAGGGAGTCAGGAGGAGTCACAGCGCCAGGGACGAGGAGCTGGAGAAGATGAAGAGGGACCAGAACGATCTGTTGGAGCTACTGGACGATCAGGATTTGAAATTGACGGAATACAAAATGAGGTTAATAAATCTCGGCCAATCTATAGACGAAGACAATGTCGTCGAGCCCGAAGATAACCCGAGTCGTGTAAATAACAAAACAGTCGACGGAACAGATTACATCGTGACCCCGCCTTTATAG

Protein sequence:

>DPOGS213220-PA
MDFLKSGLKTVLGTPESGQQPSVAETVERLVERASNSTLLEDRRDACRALKAMSRKYRLEVGAQGLDTLRQILELDRADNETVNYALDTLNNIVSPAQFEEEEDKPHIPMNIGDQFTEMFIKDPHNIQLVLDLLDEYDFRVRLSAVQLLVSVLTNRTKDIQEIILDKPMGVSKMMDLLADTREVIRNETLLLLIKLTKGNANIQKIVAFENAFDRLFEIVTSEGYSDGGIIVEDCLLLMLNLLKNNSSNINFFKEGSYIQKMLPMFNIPENSEEVGWSPQKVVNVHCMLQLVRTLVSPSNSIQIISSCQKIMKNVGLLDALCNILMASGVPADILTETINTVGEVVRGDATNQDFIGNVIAPSYPPRPAIIVLLMSMVNEKQPFALRCAVLYCFQCYLYHNESSQSNLVQTLLPSSSDVSSLTSGQILCGGLFSSDVLSNWFSAVALKHALIDNPTQKEQALRVLLATNIGSTPVSLLHQCTLLLQQTTKLQSKVALLMLLSTWTAGCSGAVAAFLAAPGGVPLLVHHAGSNEHDDNEYLLQGLSAFLLAICIHFNDDSVATYSRDALKQLLVKRIGMETFVAKLSDVSKHELYNRAAKHPQLRVASPSDVLIDYEFCKLFKSLEGLEGPLMESVSMSPGRSSPEPQLEQYKSLIRQQDARLQELVQQLETLTAHAQNLQGALNEAQSANSLLRDENTLLKAQVGNSGSDHEDRIRQLTGEVARLKEELEGVRRSHSARDEELEKMKRDQNDLLELLDDQDLKLTEYKMRLINLGQSIDEDNVVEPEDNPSRVNNKTVDGTDYIVTPPL-