Monarch geneset OGS2.0

DPOGS206155
TranscriptDPOGS206155-TA1008 bp
ProteinDPOGS206155-PA335 aa
Genomic positionDPSCF300028 + 1620455-1625266
RNAseq coverage503x (Rank: top 25%)
Annotation
HeliconiusHMEL0061166e-7870.53% 
BombyxBGIBMGA000666-TA1e-7162.50% 
DrosophilaCralbp-PA8e-8549.35% 
EBI UniRef50UniRef50_Q9VRP81e-8249.35%Cellular retinaldehyde binding protein n=17 Tax=Endopterygota RepID=Q9VRP8_DROME
NCBI RefSeqXP_001850304.13e-8749.00%cellular retinaldehyde binding protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700454137e-8649.00%cellular retinaldehyde binding protein [Culex quinquefasciatus]
NCBI nr blastxgi|1700454133e-8248.41%cellular retinaldehyde binding protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00068101.4e-11transport
GO:00056221.4e-11intracellular
GO:00052151.4e-11transporter activity
KEGG pathway 
InterPro domain[117-299] IPR0012511.4e-39Cellular retinaldehyde-binding/triple function, C-terminal
[20-105] IPR0110745.8e-18Phosphatidylinositol transfer protein-like, N-terminal
[75-97] IPR0010711.4e-11Cellular retinaldehyde binding/alpha-tocopherol transport
Orthology groupMCL18388 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206155-TA
ATGACGACGGCGAGCGTTCTGTTAACCGTTGGGACAACTTGGAGTTCTAACACGGACATATGTAACAAAGATGAAAAACTCGGTCGTACAATCAGGAACGCGCCAGTGGTTGCGGCGAGAGAGTTGCGTGAGACTCCATCAGGAAGAGAACAGGCCTTGAGAATAATGAGGGAATGGATGCAACAGAACTGTGACATCAAAAACGTTAGACAAGATGATTCATTTTTGTTACGTTTCCTTCGTCACAAGAAATTCAGCGTGCCGATGGCTCAGCAGACCCTACTGAAATATTTGAGCCTTAGGAAATACTATCCAAGTATCTTTAAGAACATGGATTGTGAAAATCCCAAAATAAAAGATATAATTAACAGCGGGTACATAGCTGTGTCACCTGTCCGTGACAGCAATGGACGGAGGGTCATCATTTACAACATGGGTAAATTCGATCCCATTAAGTACAGCTGCTGGGACATGTGTCGAGCTCATGTGGTGGTTTATGAGAGTTTGCTTGAGGATCCAAACGATCAGGTCTTTGGTTTCACTCACGTTGGTGATGGGAGCGGGTCTACCACATCCCACGTCACTGCCTGGAACCCTATAGATTTTGCGAGGCTACTAAAGTGGGGCGAGCAATCGTTGCCAATGCGCCACAAGGAATTCCAGTTGGTAAATGTTCCCGCTGCATTAAAATACATTATAGACTTCGCAACCAGCAAAGTATCCCCTAAAATGAGCGAGAGATTGATTATACATACAAATATGAAGAATTTGCACAACAAAGTCGACGTATCCTGCCTACCGACATCCTTCGGTGGTCACATACCGTTACAAGATATGATTCGTTTTACAAAAGATTTGCTTAATGAAAGGCGACAAACTGTTCTGGCTCTCGATGATATGGAAATTCTAAGCACAAGGGGAATCATTTCGTCCAGAAAGCCTACCAACGCGCTGAAACCCGAATCAATTTCCGTTGAAGGAAGCTTTAGGAAATTAGAGATTGATTAA

Protein sequence:

>DPOGS206155-PA
MTTASVLLTVGTTWSSNTDICNKDEKLGRTIRNAPVVAARELRETPSGREQALRIMREWMQQNCDIKNVRQDDSFLLRFLRHKKFSVPMAQQTLLKYLSLRKYYPSIFKNMDCENPKIKDIINSGYIAVSPVRDSNGRRVIIYNMGKFDPIKYSCWDMCRAHVVVYESLLEDPNDQVFGFTHVGDGSGSTTSHVTAWNPIDFARLLKWGEQSLPMRHKEFQLVNVPAALKYIIDFATSKVSPKMSERLIIHTNMKNLHNKVDVSCLPTSFGGHIPLQDMIRFTKDLLNERRQTVLALDDMEILSTRGIISSRKPTNALKPESISVEGSFRKLEID-