Monarch geneset OGS2.0

DPOGS214229
TranscriptDPOGS214229-TA1809 bp
ProteinDPOGS214229-PA602 aa
Genomic positionDPSCF300014 + 891767-898147
RNAseq coverage40x (Rank: top 73%)
Annotation
HeliconiusHMEL0128371e-17953.85% 
BombyxBGIBMGA005954-TA1e-9755.67% 
DrosophilaCG33966-PA2e-8145.87% 
EBI UniRef50UniRef50_E9H5D55e-11335.45%Putative uncharacterized protein n=8 Tax=Daphnia pulex RepID=E9H5D5_DAPPU
NCBI RefSeqXP_320369.41e-9050.83%AGAP012165-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1583004502e-8950.83%AGAP012165-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1583004501e-8650.83%AGAP012165-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00068102.8e-16transport
GO:00056222.8e-16intracellular
GO:00052152.8e-16transporter activity
KEGG pathway 
InterPro domain[385-571] IPR0012512.7e-46Cellular retinaldehyde-binding/triple function, C-terminal
[304-378] IPR0110741.7e-16Phosphatidylinositol transfer protein-like, N-terminal
[344-366] IPR0010712.8e-16Cellular retinaldehyde binding/alpha-tocopherol transport
Orthology groupMCL30392 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214229-TA
ATGTCTCTACGACAATTGTGTCCCGAATTAGCCGAAAAGGCTAAGTTGGAATTAAATGAAGATCCAAAAACTATTGAGGGCGACATACAGCATATAAAGGACTGGTTGGCTAAGCAACCACACCTGAAAGTAAGAACAGATGACCAATGGTTGCTCGCTTTTATTAGAGGATGTAAGCACAGTCTCGAAAGGACTAAAGAGAAGTTGGATCTGTTTTACACATTACGAACAGTAGCACCGGAGATTTACAAAGTGAAACATAATGAACCTCTATTCAATACAATCATGGATCTGGGGAGTTACTTGATATTGCCAAAGTTGGAAAAGCCGGATTCACCTCGGATTGCTCTTATTCGGCCTGGAATGTACAATCCAGATAAATTTTCTTTTTTCGATATATTCTCTTGTGGTGCTGTATTTCAAAATATTCTGATGTACGAAGATGATGCCATAGTTATATCTGGGCTTACAACCCTTATAGACTTAGAGAGTGTAACAATGGGTCATTTGTTGCAACTCACACCGAGTGTTATGAAAAAGATGGTCGTTTACACCCAGGATGCTCTTCCAATCCGCATGAAAGGCGTTCACTATATTAACACTCCTCCAGGCTTCGAAACGGTATTCAACGCAATTAAGTCGTTGCTTAATGAGAAGAATAGAAACAGGTTGTATGTACACAACAAAAATTATAATGAATTATACAAACACATCTCCCAGGAGGTTTTACCAGCGGAATATGGAGGAAAAGGTGGCAGCATACAGGAAATTAAGGGATATTGGAAGAATAAAATAGACGCATGCAGTTCATATTTGGAAGAAGATCTTAAGAATGGAACTGATGAATCAAAACGTCCCGGAAAACCAAACACTTCTGAAAACCTATTCGGTCTAGAAGGATCTTTCCAGTTAGCCAAAAAGGCACAGGAGGAGTTAAATGAAGATCCGAAAAATATTCAACGTGACCTACAATATATTAAGGATTGGTTATCCAAGCAACCTCATTTAAAGGCTAGACTAGATGATCAGTGGCTTGTCGCTTTTTTAAGAGGATGCAAGTACAGTCTAGAGCGCACGAAAGAAAAAATAGACCTATATTATTCTATGAGGTCGTTGGCACCAGAACTATTTAGGGTGAAGGCTACTGATTCTGTTTTTGATGAATTAATCAGTTTGGGGACTTACCTGATACTGCCGAAAACCGCTACCCCTGATTCACCGAGGATTATCATAATTCGAGCTGGTTGTTATGATCCCGCTAAATACAACTTTATTGACATATTCTCTGCTACTGCACACATACAGAAGATTCTCATTTTCGAAGATGACGCAATTGTTGTATCTGGTTTTAAAACAATTATGGACATGGAAGGCATCACTCTCGCACACTTATTGCAAATCACGCCCAGCGTTATGAAGAAGATGGCTGTTCTTTCACAGGACGCCTGGCCGCTACGTATGAAAGGAGCACATTACATTAATACACCGTCATGGTTTGATAATTTTTTTAACATGGTTAAAAATTTGTTAAATGAAAAAAATAGACAGCGTCTTTACGTACATAATAAAAATTTCGAAGAACTATACAAACATATTCCTCAGGAAATATTACCAAATGAATATGGTGGAAATGGTGGTAATATTAAGGAGATTTCAGAATATTGGAAGGCTAAGGTACAAGAGTATAGCTCGTGGTTAGAAGATGATTTAAAATACGGTTCGGACGAATCAAAGCGAGTGGGAAACCCAAGGACGGCTGAGACATTGTTTGGGGTCGAGGGTTCTTTCAGACAACTGGAGTTTGATTAA

Protein sequence:

>DPOGS214229-PA
MSLRQLCPELAEKAKLELNEDPKTIEGDIQHIKDWLAKQPHLKVRTDDQWLLAFIRGCKHSLERTKEKLDLFYTLRTVAPEIYKVKHNEPLFNTIMDLGSYLILPKLEKPDSPRIALIRPGMYNPDKFSFFDIFSCGAVFQNILMYEDDAIVISGLTTLIDLESVTMGHLLQLTPSVMKKMVVYTQDALPIRMKGVHYINTPPGFETVFNAIKSLLNEKNRNRLYVHNKNYNELYKHISQEVLPAEYGGKGGSIQEIKGYWKNKIDACSSYLEEDLKNGTDESKRPGKPNTSENLFGLEGSFQLAKKAQEELNEDPKNIQRDLQYIKDWLSKQPHLKARLDDQWLVAFLRGCKYSLERTKEKIDLYYSMRSLAPELFRVKATDSVFDELISLGTYLILPKTATPDSPRIIIIRAGCYDPAKYNFIDIFSATAHIQKILIFEDDAIVVSGFKTIMDMEGITLAHLLQITPSVMKKMAVLSQDAWPLRMKGAHYINTPSWFDNFFNMVKNLLNEKNRQRLYVHNKNFEELYKHIPQEILPNEYGGNGGNIKEISEYWKAKVQEYSSWLEDDLKYGSDESKRVGNPRTAETLFGVEGSFRQLEFD-