Monarch geneset OGS2.0

DPOGS201347
TranscriptDPOGS201347-TA798 bp
ProteinDPOGS201347-PA265 aa
Genomic positionDPSCF300176 + 799190-802402
RNAseq coverage974x (Rank: top 13%)
Annotation
HeliconiusHMEL0123868e-14285.66% 
BombyxBGIBMGA003132-TA9e-11466.79% 
DrosophilaCG10026-PB8e-7147.56% 
EBI UniRef50UniRef50_D6WK776e-8153.25%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WK77_TRICA
NCBI RefSeqXP_001606243.11e-8455.20%PREDICTED: similar to CRAL/TRIO domain-containing protein [Nasonia vitripennis]
NCBI nr blastpgi|3454941551e-8355.20%PREDICTED: alpha-tocopherol transfer protein-like isoform 1 [Nasonia vitripennis]
NCBI nr blastxgi|3454941558e-8355.20%PREDICTED: alpha-tocopherol transfer protein-like isoform 1 [Nasonia vitripennis]
Group
Gene OntologyGO:00068101.7e-27transport
GO:00056221.7e-27intracellular
GO:00052151.7e-27transporter activity
KEGG pathway 
InterPro domain[57-237] IPR0012516.1e-45Cellular retinaldehyde-binding/triple function, C-terminal
[17-39] IPR0010711.7e-27Cellular retinaldehyde binding/alpha-tocopherol transport
[1-56] IPR0110747e-12Phosphatidylinositol transfer protein-like, N-terminal
[7-39] IPR0082731.9e-06Cellular retinaldehyde-binding/triple function, N-terminal
Orthology groupMCL12295 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201347-TA
ATGATATACGAGCGGGGTGAATGCACTCCAAAACGCATGGACGATGAATACCTGCTTAGGTTTTTACGAGCTCGGAATTTTATCCCGCAAAGAGCGCACAGACTGATGGTGAATTATCACCAGTTCAAAGAAGACAATCCAGAGTTATTCGAGAACGTATATCCCCTGGATTTACGGAGGATTGGAGACACCAACGTCATGGCTGTACCGCCATATCGTGATCAAAATGGTCGCAGACTTTTGCTTTATAGAATTGGATCCTGGGACCCCAAATCTGTTGCTGTGGAAGATATGCTGAAGGCGACAATATTCGCCTTGGAGCTCGGATTGTTGGAGCAGCGAACCCAAATTCTGGGTGGAATAGCTTTGTTTGATTTGGAAGATTTAGGTACACAGCATTTGTGGCAGGTCACGCCGTCGGTTGCGAGCAAAATTATTAAACTTTTAGTCTCAAGTTTTCCCGCAACCACTCACGCCATACATATAATTAACCATTCTTGGATATTCGATAAAATGTACAGCATATTCAAACCGCTACTGACAGCGCAAATGCGCTCGAGAATCTTCTTCCACGGATATGACGTCACATCCCTTCACGAGCACATCCAGCCGGACTACCTTCCGGAACGGTATGGTGGTGTATGGCCCGACTATCCGTATACCATCTGGCTCGAATCTTTGAAGAAAAATTACACCGTTGCCAAACAGGTACTGGCGCTCGGGTATAAGTTCCGCGAAGAGGAGGTTTGTCCCGAAGTCGTTAGAAGACTAAAGGAGGAAGGTATAAAATTATCGTGA

Protein sequence:

>DPOGS201347-PA
MIYERGECTPKRMDDEYLLRFLRARNFIPQRAHRLMVNYHQFKEDNPELFENVYPLDLRRIGDTNVMAVPPYRDQNGRRLLLYRIGSWDPKSVAVEDMLKATIFALELGLLEQRTQILGGIALFDLEDLGTQHLWQVTPSVASKIIKLLVSSFPATTHAIHIINHSWIFDKMYSIFKPLLTAQMRSRIFFHGYDVTSLHEHIQPDYLPERYGGVWPDYPYTIWLESLKKNYTVAKQVLALGYKFREEEVCPEVVRRLKEEGIKLS-