Monarch geneset OGS2.0

DPOGS201345
TranscriptDPOGS201345-TA1101 bp
ProteinDPOGS201345-PA366 aa
Genomic positionDPSCF300176 + 738116-747266
RNAseq coverage767x (Rank: top 17%)
Annotation
HeliconiusHMEL0123903e-11861.43% 
BombyxBGIBMGA003129-TA2e-10683.10% 
DrosophilaCG5958-PA1e-6953.77% 
EBI UniRef50UniRef50_Q9VM122e-6753.77%CG5958 n=40 Tax=Neoptera RepID=Q9VM12_DROME
NCBI RefSeqXP_001606183.12e-8467.44%PREDICTED: similar to CRALBP [Nasonia vitripennis]
NCBI nr blastpgi|1565548095e-8367.44%PREDICTED: clavesin-1-like [Nasonia vitripennis]
NCBI nr blastxgi|1565548092e-8167.44%PREDICTED: clavesin-1-like [Nasonia vitripennis]
Group
Gene OntologyGO:00068101.7e-12transport
GO:00056221.7e-12intracellular
GO:00052151.7e-12transporter activity
KEGG pathway 
InterPro domain[174-359] IPR0012517.6e-52Cellular retinaldehyde-binding/triple function, C-terminal
[181-205] IPR0010711.7e-12Cellular retinaldehyde binding/alpha-tocopherol transport
[5-77] IPR0110745.7e-11Phosphatidylinositol transfer protein-like, N-terminal
Orthology groupMCL12859 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201345-TA
ATGAGCAGCCCTAGCGATTTTAGAATTGAAAGAAATGTCGAGCTATCAGAGGAGACGAAGGAGATCGCTGAACGTGAACTCCGGGAGACTCCTGAGCGTGTTCGGGAAGCTTTGGAAAGACTGAGAGAACTGTTGAAAGAGAACAAAGACATTTATTTCGGAGATGAAGACGAGATATTGACAATATTTCTCCGACCTTGCAAATGGTACCCCGAGAGCGCTCTGGCTTTGGGGACATTTCCTGCTGTCAGATATAACCATGTCCAAAGGTCAGTTAGGATGGGAACCATTGGACATGCAGTTATCATTGTCATAACAAAGGAACCTTTGGACGAAGCTTCTAACGACAACCAGTCAGACATAAACGGCGTGATTTTGGATCCTGATCCAGATATAATACCAACGTACACAAATTGGATGTTAATAAAACAAGTAATAGCGTTACGAGATCCAATGCGTCGGGCTGCTGATTTTAAGCGCGACAACGCGAGCTTGTTGGACGGTTTACTGCCAGAACACGAGAAGGAGGCTTTCCTGGAGCACAAAGTAGTAAACGTCATGAAGGGTCGTGATGATAAAGGAAGAAGAGTGCTCATTGTCAACGTTGGAGGTAGCTGGAACCCCAAAAAGGTGACGGCGGATCAACTGTTCAGGTTATTCTATTTAATTCACGAAGCTGCCATGTTGGAACCGGAGTCCCAAGTCCGAGGAACAGTCGTCATTATGGACTTCCACAAAATGGGCATGAGTCAGACGATGGGTCTAACGCCGGCGTTTTCTAAACGTCTGCTCACTTTCATCCAGGACGCGTTGCCTCTTAGATTGAAGGAGGTGCACTTCGTTAAAGAGCCCATGATCTTCAACATGGTTTGGAAGCTGTTCAAGCCTTTGATCAGGGAAAAGTTGAAGGGCAGGATCTTCTTCCACGGCAGCAACATGTCATCTCTACATAAGCATTTGGCTCCGAGTCATTTGCCCGCGGACTACGACGGTGTCCTGGGCCCCATAGACTACTCGGGCGCCGACTGGTACCCCGTCGTCAACGAGGTGCTCCCGCACATACAGAACTGGAACTCGTACGGTTATGTCAAGAAGGAATGA

Protein sequence:

>DPOGS201345-PA
MSSPSDFRIERNVELSEETKEIAERELRETPERVREALERLRELLKENKDIYFGDEDEILTIFLRPCKWYPESALALGTFPAVRYNHVQRSVRMGTIGHAVIIVITKEPLDEASNDNQSDINGVILDPDPDIIPTYTNWMLIKQVIALRDPMRRAADFKRDNASLLDGLLPEHEKEAFLEHKVVNVMKGRDDKGRRVLIVNVGGSWNPKKVTADQLFRLFYLIHEAAMLEPESQVRGTVVIMDFHKMGMSQTMGLTPAFSKRLLTFIQDALPLRLKEVHFVKEPMIFNMVWKLFKPLIREKLKGRIFFHGSNMSSLHKHLAPSHLPADYDGVLGPIDYSGADWYPVVNEVLPHIQNWNSYGYVKKE-