Monarch geneset OGS2.0

DPOGS212213
TranscriptDPOGS212213-TA1203 bp
ProteinDPOGS212213-PA400 aa
Genomic positionDPSCF300323 + 267138-269005
RNAseq coverage334x (Rank: top 35%)
Annotation
HeliconiusHMEL0068320.084.83% 
BombyxBGIBMGA001159-TA0.083.21% 
DrosophilaVps36-PA1e-11351.23% 
EBI UniRef50UniRef50_Q9VU871e-11151.23%Vacuolar protein-sorting-associated protein 36 n=24 Tax=Coelomata RepID=VPS36_DROME
NCBI RefSeqXP_001655431.12e-12855.67%hypothetical protein AaeL_AAEL011556 [Aedes aegypti]
NCBI nr blastpgi|1571296364e-12755.67%hypothetical protein AaeL_AAEL011556 [Aedes aegypti]
NCBI nr blastxgi|1571296361e-12155.67%hypothetical protein AaeL_AAEL011556 [Aedes aegypti]
Group
KEGG pathwayaag:AaeL_AAEL0115566e-128 
 K12190 (VPS36, EAP45)maps-> Endocytosis
InterPro domain[169-383] IPR0072861.2e-54EAP30
[3-92] IPR0216481.1e-15Vacuolar protein sorting protein, Vps36
Orthology groupMCL13173 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212213-TA
ATGGATAGATTTGAATACATAGAAGCTAGACTATTTGAAGGAGAAACTTATCTCAAACGAGATAAAAATGTGAAAATCTATGATGGTGAAGATAAGACCCAATTTGTAGATGGAGAAATTATTCTAACAACTCACAGAATACTGTGGGGTAAACCTGGTGATATACCCAAGGGGCTTGTTTGTTTATCACTCCACCTATATTATATATTCTGTGTAGAAGAGGAGAGCGGAGGTGTTTTTGGTCTTGGAGGTCCAAAACGAATCATATTGCACCTTGGTCCAGCACTTCCAGGTAAAAGACCAGGACCGGCTGTAGTGAGCCCATTTCACTTCATAAAATTTTCATTTAAAGATGGAATTGATTCTGTGTTTTATAAGGCATTAAACGATGCCGTCGCTGCGAAAGCTTGGCAGATAGAGACACCTAATAATTCTAATTTAACTTCACCTACAAGTGTTACGCCTAAGACTTCAACATCACCTATAAATTCTAAAATTCGTTCTGGTATTGTTGGTATCGAGAGGAGTATTGAAGAGCAACACAAAGCTACAGATCAAAGTATCAGTATAGCTTTTCAGGATTTAACAAAACTCATGGAGAAAGCTAAAGAAATGGTCACAATATCTAAGACAATTTCATCGAAAATCAGGGAGAAGCAAGGTGATATATCAGAGGACGACACAGTGAGATTCAAATCCTACTTAATGAGTTTGGGTATAGACGATCCTGTCACTAGAGATGCATTTAGATCAGACTCGGAATATTACATGGGTCTCTCTCATCAAATTGCCGATATGATAGTTGCTGCTTTGGTGGATTGTGGCGGTATCATGTCCTTGGCGGATGTGTGGTGTAGAGTGAACAGAGCTAGAGGTCTAGAACTCATTTCACCCGAGGATTTATTGAATGCTTGCAAATTGTTACAGACTATTGATGCTCCAATGTCCTTACGTAAATTTCCAAGTGGTGCATGTGTGTTACAACTTAACAGTCATAGGGATGAAGAAGTTGCAAAAACAACAAGTGAACTGCTTGAGGCAAGCGGCTTGTTAACCCCGGAGAAGTTATCTCAAATAGCAAACGTTTCCGTCCTACTTGCACGAGAACAACTATTCACAACTGAACGAATGGGCCTAGCTTGCAGAGATGAATCTATCGAAGGATTGGCTTTCTACCCTAATTTGTTTCTAAGCAAAGTGTAA

Protein sequence:

>DPOGS212213-PA
MDRFEYIEARLFEGETYLKRDKNVKIYDGEDKTQFVDGEIILTTHRILWGKPGDIPKGLVCLSLHLYYIFCVEEESGGVFGLGGPKRIILHLGPALPGKRPGPAVVSPFHFIKFSFKDGIDSVFYKALNDAVAAKAWQIETPNNSNLTSPTSVTPKTSTSPINSKIRSGIVGIERSIEEQHKATDQSISIAFQDLTKLMEKAKEMVTISKTISSKIREKQGDISEDDTVRFKSYLMSLGIDDPVTRDAFRSDSEYYMGLSHQIADMIVAALVDCGGIMSLADVWCRVNRARGLELISPEDLLNACKLLQTIDAPMSLRKFPSGACVLQLNSHRDEEVAKTTSELLEASGLLTPEKLSQIANVSVLLAREQLFTTERMGLACRDESIEGLAFYPNLFLSKV-