Monarch geneset OGS2.0

DPOGS215398
TranscriptDPOGS215398-TA873 bp
ProteinDPOGS215398-PA290 aa
Genomic positionDPSCF300088 + 264869-267995
RNAseq coverage0x (Rank: top 99%)
Annotation
HeliconiusHMEL0036761e-7749.65% 
BombyxBGIBMGA012431-TA2e-11666.32% 
DrosophilaCG10026-PB2e-3832.64% 
EBI UniRef50UniRef50_E2C3S64e-4735.59%Retinaldehyde-binding protein 1-like protein 1 n=10 Tax=Formicidae RepID=E2C3S6_HARSA
NCBI RefSeqXP_001606243.12e-5037.15%PREDICTED: similar to CRAL/TRIO domain-containing protein [Nasonia vitripennis]
NCBI nr blastpgi|3838645862e-4939.69%PREDICTED: alpha-tocopherol transfer protein-like [Megachile rotundata]
NCBI nr blastxgi|3454941555e-4837.37%PREDICTED: alpha-tocopherol transfer protein-like isoform 1 [Nasonia vitripennis]
Group
Gene OntologyGO:00068101e-09transport
GO:00056221e-09intracellular
GO:00052151e-09transporter activity
KEGG pathway 
InterPro domain[121-281] IPR0012511.1e-31Cellular retinaldehyde-binding/triple function, C-terminal
[20-100] IPR0110744.8e-12Phosphatidylinositol transfer protein-like, N-terminal
[61-83] IPR0010711e-09Cellular retinaldehyde binding/alpha-tocopherol transport
Orthology groupMCL23367 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215398-TA
ATGCCATTCCAAGAAATAGCCTTCCGCGCGGAACTTGACCGCCATGAGGATCCAGAGTTCGAGTACCAGGCGAGCATACTCTGTGAGGAGGACGCCGCGGCGCGCGCCGACCGCGTACAGCTACTGAGGAATATGATTTATGAGCGTGGGGAGTGTGTCCCACCTCGTATGGACGACGCGTTCTTGTTGAGATTCCTCCGCGCTCGTCGCTCCATACCCGCGCGCGCGCACCGCCTCATGGTTCGTTATTGTAAATTCCGTGAGCAGCATCCTCATCTGTGGAAAAACGTCTATTGGTACAGCTTGTCGAAGCTGGGTGAAACCTTCGAGGGAGTTCTCTTTGACAGACCGGATGTCGGAAGACTTATTATATGCCGTTTAGGTCAATGGAACCCGGACATATACCCCGCTGACGATCTCATTCGCGGTTGTCTCTTGTCCCTGGAGATAGGCATCATGCAACCGAAGCTCCAGGTTCTAGGCGGAACAGCTATAGTAGACTGCGAGGGGATCACCATGAAGCATATGAGACAGTTGTCACCGTCGATTGCTGTTCAAGCTATGAATATCATGGGGTTCTCGTTTCCTCTCCACCAACGCGGTGTTCACATCGTGAACTGCTCGCGGTGGTTCGAGAAACTGTTCCATTTGTTGAAACGTTTCGCTCCTGCTGACGAGTTGTGGAGGAAGGTCTACTTCCATGGATACGAGTATACATCTTTACATAGGTACATCGACCCCGAGTGCCTCCCAAAGAGATACGGCGGCCACCGCGAGTCCGTCTCGGTTAGAGATTGGCTCACAAAAATAAAACAGTATAAGAACAAACAGTTCGACGACGACATCAGCTGTTTGGGATATGCCATAGATTAA

Protein sequence:

>DPOGS215398-PA
MPFQEIAFRAELDRHEDPEFEYQASILCEEDAAARADRVQLLRNMIYERGECVPPRMDDAFLLRFLRARRSIPARAHRLMVRYCKFREQHPHLWKNVYWYSLSKLGETFEGVLFDRPDVGRLIICRLGQWNPDIYPADDLIRGCLLSLEIGIMQPKLQVLGGTAIVDCEGITMKHMRQLSPSIAVQAMNIMGFSFPLHQRGVHIVNCSRWFEKLFHLLKRFAPADELWRKVYFHGYEYTSLHRYIDPECLPKRYGGHRESVSVRDWLTKIKQYKNKQFDDDISCLGYAID-