Monarch geneset OGS2.0

DPOGS207391
TranscriptDPOGS207391-TA993 bp
ProteinDPOGS207391-PA330 aa
Genomic positionDPSCF300267 + 142420-143412
RNAseq coverage1114x (Rank: top 11%)
Annotation
HeliconiusHMEL0122470.091.84% 
BombyxBGIBMGA008886-TA2e-17386.75% 
DrosophilaCG10237-PB3e-2527.93% 
EBI UniRef50UniRef50_B0W1F62e-8049.66%Cellular retinaldehyde-binding protein n=2 Tax=Culicinae RepID=B0W1F6_CULQU
NCBI RefSeqXP_001842540.13e-8149.66%cellular retinaldehyde-binding protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700293186e-8049.66%cellular retinaldehyde-binding protein [Culex quinquefasciatus]
NCBI nr blastxgi|1700293181e-7750.87%cellular retinaldehyde-binding protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00068109.4e-30transport
GO:00056229.4e-30intracellular
GO:00052159.4e-30transporter activity
KEGG pathway 
InterPro domain[101-276] IPR0012511.4e-43Cellular retinaldehyde-binding/triple function, C-terminal
[62-84] IPR0010719.4e-30Cellular retinaldehyde binding/alpha-tocopherol transport
[1-92] IPR0110744.3e-10Phosphatidylinositol transfer protein-like, N-terminal
Orthology groupMCL13842 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207391-TA
ATGAACGAGTACGCGGAGTACGACTGGAAGCTGCAGCAGATTCAGAATGAAAGGCTCTGTGACGTGTCCAGGAAGAGACTTACGGATGGGGACAGGCAACGAGCTCTGGATGGCCTCAAGGACGCCGTAGACTCGAGTCTCAACTTGGCCCTTCGCGACAAGAGCCGCTGTCAAGACGATAACTTTCTGCGACGGTTTCTATACGCCCGTAAACACGACGTTCAGCAGAGTTTTGAGTTGTTGGTACGTTACCATCAACATCGTCGGGAGCATCCGGAACTATGGGCGAATGCGGACGGCGGTGTGTTGAGGGCCCTCGCCGATGGACTCCCAGGCGTGCTGGCGCAACGTGACCGGCGCGGTCGTTGCGTGCTCATTATGTTTGCATCCAATTGGACTCCGCATGCATGCCCTCTAATATCTGTGTTCCGTGCCCTGCTTCTCACACTAGAAAGGACTCTTAATGAAGTGCAAAATCAAGCCAACGGATACGTTATCGTTGTTGACTGGACAGAGTTTACGTTTAAACAATCGTGTAGCTTACAAGCGAAAATCCTAAAAATGATGATCGATTGTTTGCAAGACTGCATGCCGGCACGGTTTAAAAGCATACATTTCATTGGACAGCCTTGGTATGTAGAAACAGCATTAGCAGTTATCAAGCCCTACCTAAAAGCGAAAACCCGTGAAAGAATAGTTCTACATGGGAACAATCTATCTACATTACATGACGCATTACCGTTGGATATTTTACCAGCGGAGTTAGGCGGGGAAGGGCCTTCCTATAACTCTGAACATTGGTTACAAGAATTCTGCCGTTGTGAAAACATAGATGCCAAGCCCGTGTCGGTGGCGGTACCCCCCGCGCCCGTCGATAACGATCTTCCTGCGGCGAAACTCAACAAAGCGTATAAACAAAAAGATAATCAGAACTTTTCTTTTCATGGAAACGAAAAAAGTGCAAAGTCTGAGCTGCTCCGTGATAAAGATTGA

Protein sequence:

>DPOGS207391-PA
MNEYAEYDWKLQQIQNERLCDVSRKRLTDGDRQRALDGLKDAVDSSLNLALRDKSRCQDDNFLRRFLYARKHDVQQSFELLVRYHQHRREHPELWANADGGVLRALADGLPGVLAQRDRRGRCVLIMFASNWTPHACPLISVFRALLLTLERTLNEVQNQANGYVIVVDWTEFTFKQSCSLQAKILKMMIDCLQDCMPARFKSIHFIGQPWYVETALAVIKPYLKAKTRERIVLHGNNLSTLHDALPLDILPAELGGEGPSYNSEHWLQEFCRCENIDAKPVSVAVPPAPVDNDLPAAKLNKAYKQKDNQNFSFHGNEKSAKSELLRDKD-