Monarch geneset OGS2.0

DPOGS203414
TranscriptDPOGS203414-TA1011 bp
ProteinDPOGS203414-PA336 aa
Genomic positionDPSCF300003 + 1585276-1587690
RNAseq coverage330x (Rank: top 35%)
Annotation
HeliconiusHMEL0063874e-15077.25% 
BombyxBGIBMGA012351-TA1e-17081.60% 
DrosophilaCralbp-PA5e-6440.71% 
EBI UniRef50UniRef50_E2ASK52e-12262.58%Alpha-tocopherol transfer protein-like n=17 Tax=Neoptera RepID=E2ASK5_CAMFO
NCBI RefSeqXP_392138.34e-12664.35%PREDICTED: similar to Cellular retinaldehyde binding protein CG10546-PA [Apis mellifera]
NCBI nr blastpgi|3407155077e-12664.65%PREDICTED: alpha-tocopherol transfer protein-like [Bombus terrestris]
NCBI nr blastxgi|3504228443e-12064.65%PREDICTED: alpha-tocopherol transfer protein-like [Bombus impatiens]
Group
Gene OntologyGO:00068102.4e-11transport
GO:00056222.4e-11intracellular
GO:00052152.4e-11transporter activity
KEGG pathway 
InterPro domain[107-287] IPR0012514.1e-34Cellular retinaldehyde-binding/triple function, C-terminal
[10-96] IPR0110749.6e-21Phosphatidylinositol transfer protein-like, N-terminal
[65-87] IPR0010712.4e-11Cellular retinaldehyde binding/alpha-tocopherol transport
Orthology groupMCL16894 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203414-TA
ATGTCGAGAAGGCTTACGAAAAGGTACTTGACCCCGGCGGATGCTTACAAATGCCCACTGTCAGCAGAAACACAAGCGATAGCCGAAAAGGAGCTGAGAGAAACGGAAAATTCAAGGTCTCAAGCCTTAGAGGCTTTAAGGATGTGGTTGGAACAGAATCCAAAATTCTTATCCATACGATTAGACGCCAATTTCCTTCTACGTTTTCTCCGTACAAAGAAATTCAGCGTTCCGATGGCTCAAGAGGCTATCGAACGTTACGTACTCCTTAGACAGTCATGGGGGATAGCTTTTAATCAATTGGATCACACCTTGCCTGTCATTTCTGAGATCATCGAGTTAGGATATATTTTTGCAAGCCCTTTCAAAGACAAACTTGGACGACGGGTTGTGATATACCGACCGGGCGTTTTCGATCCATACAAATTCACCAACCAAGACATGTGTCGAGTAATGGGTATTTGCTATGAGACACTCATGGAAGACGAGGAGACCCAGGTGCGAGGTTTGGTGCATTACGCAGACGGCGGAGGTGTCAGTTTTCCACACCTTACACTATTCACTCCAAGAGAAGCTGTGAGGATAGTAAAAAATGGGGAGCGCACGATTCCACTAAGGCACAAGGAAATCTATGGTGCTAATGTTCATCCGACGATCAAATTTGCTTTAGACTTTGGGATGGCGCTCATATCTGAGAAGATCAGAAAGCGCGTCAAGTTATACACGTCCATAGAGGACGTAGAAATAGATAAACAACTTTTGCCTCAGGAGTATGGCGGTACAATGCCAATGAAAATAATGATTGAAAAATGGAAAGTGGAAATGGCAAGTAAGAGGGAAACGTTATTGATGAACGACAAGATGGCTGTCCGTCTAGAAATGTATAGCGAGGCGGCGCGAGAGGGCGCTGTGTCCGCATTGAGAGCTGGTGGCACTTGTGCTGGTGCAGATTCTGTTGGAGACGCGATGAGAGGACTCACCGGAAACTTCAGGAAACTTGAAGTCGATTAA

Protein sequence:

>DPOGS203414-PA
MSRRLTKRYLTPADAYKCPLSAETQAIAEKELRETENSRSQALEALRMWLEQNPKFLSIRLDANFLLRFLRTKKFSVPMAQEAIERYVLLRQSWGIAFNQLDHTLPVISEIIELGYIFASPFKDKLGRRVVIYRPGVFDPYKFTNQDMCRVMGICYETLMEDEETQVRGLVHYADGGGVSFPHLTLFTPREAVRIVKNGERTIPLRHKEIYGANVHPTIKFALDFGMALISEKIRKRVKLYTSIEDVEIDKQLLPQEYGGTMPMKIMIEKWKVEMASKRETLLMNDKMAVRLEMYSEAAREGAVSALRAGGTCAGADSVGDAMRGLTGNFRKLEVD-