Monarch geneset OGS2.0

DPOGS212706
TranscriptDPOGS212706-TA1032 bp
ProteinDPOGS212706-PA343 aa
Genomic positionDPSCF300012 - 754056-760932
RNAseq coverage3382x (Rank: top 4%)
Annotation
HeliconiusHMEL0085994e-7747.35% 
BombyxBGIBMGA013159-TA1e-8650.68% 
DrosophilaCG11550-PA1e-4936.03% 
EBI UniRef50UniRef50_F4WLR16e-6043.97%Alpha-tocopherol transfer protein-like protein n=8 Tax=Formicidae RepID=F4WLR1_ACREC
NCBI RefSeqXP_396507.28e-5641.00%PREDICTED: similar to CG11550-PA [Apis mellifera]
NCBI nr blastpgi|3320246652e-5943.97%Alpha-tocopherol transfer protein-like protein [Acromyrmex echinatior]
NCBI nr blastxgi|3320246656e-5943.97%Alpha-tocopherol transfer protein-like protein [Acromyrmex echinatior]
Group
Gene OntologyGO:00068102.4e-05transport
GO:00056222.4e-05intracellular
GO:00052152.4e-05transporter activity
KEGG pathway 
InterPro domain[128-310] IPR0012513.7e-34Cellular retinaldehyde-binding/triple function, C-terminal
Orthology groupMCL20958 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212706-TA
ATGAACAGCACTTCTATACAACAGTCTACGTATAGAGTTACTTCTATGGCCGACCTTAAACGGACTGCTGTGAAAAAATTGCCCAAACAACCAGGGTCGACGCCGGGTTTCGCACGAACACAAATGACGTCACAAGACGTTCAACCGATATCTTTGGAGGAAGAGTACAGAAAGAAGACAGGGATCAGTCCAGAAGATATACAGAAGTTACGTAGTTGGATGCAGACACAGCCGCATCTGCCAGAGAATTATATTACAGATTTAGATTTAATCCTGGCGTTCCATTCCTGCGACTGTAGTTCGGGCCTAACGAAGCAAGTGCTGGACACACATTACACTTTGAGGACTTTGTTCCCCTGCTTCAAGGACCGCAGAGTAGATCAAGTCATAGAGACGGCCGAAACGGTGCTCCTGATACCTTTGCCGACGCCAGCAAAGCATGGGTACAAAATAAACTACAGTCACGTTCTAAAAACAGATCCAAAATCTTTCAATTTTAGCGAAACAGTTAAGGCGTTGTTTATGATAATAGATGTTTACCAGTACGAAGAAGGAACATGGCCTGGATTTCTATTTGTTGTAGACTTTGAAGGTATCTCCCTAGGTCATTTAGGCAAGATAGACTTGCAAAGTTTACAACATGTACTATATTTTCTTCAGGAGGCTATGCTGGTTAAATTAAAGGGCATGCACTTCATCAACGCACCGTCCTTCATCGATAAACTTTTATTGATGATGAGGCCTTTCTTGAAAAAGGAACTCATGGATATGTTACACATCCATACAACCGGATCTAACAAACTACAAAATTTTATTGACATAGAAGCCCTGCCAAAAGAAGCTGGCGGCTCGTACAAAAGCATTCACGGATGTAAAGATGACGTCATAGCCAAGTTGAAAAAACATGCAGATTTTTTCGAAAAAGAAAAATACAAACGTGTTACGGAATCTCTGAGACCTGGAAGACCTAAAACGATAACAGATATCTTCGGAGGAATCGAAGGCTCCTTCAAGAAGCTGGAAATAGATTGA

Protein sequence:

>DPOGS212706-PA
MNSTSIQQSTYRVTSMADLKRTAVKKLPKQPGSTPGFARTQMTSQDVQPISLEEEYRKKTGISPEDIQKLRSWMQTQPHLPENYITDLDLILAFHSCDCSSGLTKQVLDTHYTLRTLFPCFKDRRVDQVIETAETVLLIPLPTPAKHGYKINYSHVLKTDPKSFNFSETVKALFMIIDVYQYEEGTWPGFLFVVDFEGISLGHLGKIDLQSLQHVLYFLQEAMLVKLKGMHFINAPSFIDKLLLMMRPFLKKELMDMLHIHTTGSNKLQNFIDIEALPKEAGGSYKSIHGCKDDVIAKLKKHADFFEKEKYKRVTESLRPGRPKTITDIFGGIEGSFKKLEID-