Monarch geneset OGS2.0

DPOGS206918
TranscriptDPOGS206918-TA939 bp
ProteinDPOGS206918-PA312 aa
Genomic positionDPSCF300001 - 1402106-1408971
RNAseq coverage800x (Rank: top 16%)
Annotation
HeliconiusHMEL0106313e-16687.82% 
BombyxBGIBMGA013017-TA6e-14275.95% 
DrosophilaCG2663-PB5e-9754.55% 
EBI UniRef50UniRef50_E2A8W63e-9954.05%Alpha-tocopherol transfer protein-like n=2 Tax=Endopterygota RepID=E2A8W6_CAMFO
NCBI RefSeqNP_001040355.11e-14778.85%cellular retinaldehyde-binding protein [Bombyx mori]
NCBI nr blastpgi|1140515903e-14678.85%cellular retinaldehyde-binding protein [Bombyx mori]
NCBI nr blastxgi|1140515901e-14378.85%cellular retinaldehyde-binding protein [Bombyx mori]
Group
Gene OntologyGO:00068104.8e-17transport
GO:00056224.8e-17intracellular
GO:00052154.8e-17transporter activity
KEGG pathway 
InterPro domain[97-281] IPR0012515.2e-42Cellular retinaldehyde-binding/triple function, C-terminal
[15-94] IPR0110741.1e-17Phosphatidylinositol transfer protein-like, N-terminal
[55-77] IPR0010714.8e-17Cellular retinaldehyde binding/alpha-tocopherol transport
Orthology groupMCL26030 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206918-TA
ATGAAGTTGAAGTCAGAGTTACTGGTGCAACCTTCGGGGGAGTTGTCCAAGAAGATTAGGGAAGAACTGAAAGAAGATCCTAAAACAAGAGATCGCGACCTCGCCGCTATCAAGGACTGGCTCCGGAAACAACCTCATCTTCCTGATGAATGGGAGGACATGCCGCTCCTCACATTCCTCAGGGGAAGTAGTTTCTCATTGGAGAAATGCAAACGTAAATTGGACATGTACTTCACTATGAGAGCCGCTTGCCCAGAGTTCTTCACCAACCGCGACGCCACAAGCCCGGCCTTGAGGGAAGTCCTGAAAACTAAGCTACAAGGACCAGCACTGCCTGGCGTAACGCCGAATGGCAGAAGAGTGACAATTTGCAGAGGTCTTTATCCCAGTTTAGATTCACAGCAAATAACGGACACTCTGAAGCTAGCTCTCATGATCGGAGACATCAGACTAATTGAAGAAGTGGAAGGCGTCGCTGGCGACATTTATATTTTAGATGGAGCTGTTCTTGGTGCAAGTCTGCTAGGAAGATTATCTCCGTCAGTAATCAAGAAGTTCATGATTTGTGTTCAAGAAGCATATCCAGTGAAACTAAAGGAAGTGCATATAATCAACACTTCTCCTACCGTAGAGAGATTTGTGACGTTCGTGAAGCCGTTCCTTAAGGAGAAGATACGGAAGAGAATCTTCATTCATAAAGATATCAAGGACTTATACAAGTACGTCCCACAAGAAATGTTACCGAAAGAGTACGGCGGTCAGTGTGGTACCATGGACGAGTTACAGCAAAACTGGACGGACAAGCTGATAGAGTACCGTGACTGGTTTAAAGCTCAAGACGCCCTGGTCGCTAACGAGAGCTTGAGACCAGGGCGCCCTACTAACTACGATGAATTATTCGGAATCGACGGCTCGTTTAGACAACTGGCCATTGATTAA

Protein sequence:

>DPOGS206918-PA
MKLKSELLVQPSGELSKKIREELKEDPKTRDRDLAAIKDWLRKQPHLPDEWEDMPLLTFLRGSSFSLEKCKRKLDMYFTMRAACPEFFTNRDATSPALREVLKTKLQGPALPGVTPNGRRVTICRGLYPSLDSQQITDTLKLALMIGDIRLIEEVEGVAGDIYILDGAVLGASLLGRLSPSVIKKFMICVQEAYPVKLKEVHIINTSPTVERFVTFVKPFLKEKIRKRIFIHKDIKDLYKYVPQEMLPKEYGGQCGTMDELQQNWTDKLIEYRDWFKAQDALVANESLRPGRPTNYDELFGIDGSFRQLAID-