Monarch geneset OGS2.0

DPOGS200022
TranscriptDPOGS200022-TA882 bp
ProteinDPOGS200022-PA293 aa
Genomic positionDPSCF300337 - 205723-212351
RNAseq coverage952x (Rank: top 13%)
Annotation
HeliconiusHMEL0036765e-13675.09% 
BombyxBGIBMGA012441-TA6e-13775.09% 
DrosophilaCG10026-PB5e-4633.90% 
EBI UniRef50UniRef50_D6WK773e-5338.65%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WK77_TRICA
NCBI RefSeqXP_001606243.16e-6041.79%PREDICTED: similar to CRAL/TRIO domain-containing protein [Nasonia vitripennis]
NCBI nr blastpgi|3454941559e-5941.79%PREDICTED: alpha-tocopherol transfer protein-like isoform 1 [Nasonia vitripennis]
NCBI nr blastxgi|3454941557e-5841.37%PREDICTED: alpha-tocopherol transfer protein-like isoform 1 [Nasonia vitripennis]
Group
Gene OntologyGO:00068104.1e-15transport
GO:00056224.1e-15intracellular
GO:00052154.1e-15transporter activity
KEGG pathway 
InterPro domain[100-283] IPR0012511.4e-38Cellular retinaldehyde-binding/triple function, C-terminal
[22-102] IPR0110741.5e-19Phosphatidylinositol transfer protein-like, N-terminal
[63-85] IPR0010714.1e-15Cellular retinaldehyde binding/alpha-tocopherol transport
[50-85] IPR0082736.8e-10Cellular retinaldehyde-binding/triple function, N-terminal
Orthology groupMCL25968 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200022-TA
ATGTATGAATGCTTCCTTGAGATAGCGTTCGAGGCTGAACTGAATACAAAAGAAGACCCTGAGCTGCTTGAACTAGCACCCGAACTATGCAACGAGGACGCGTCCACCAGGGCCACGGCCGTCACAAATCTCAGGAATATGATTTTCGAAAGAGCAGAATGCAAACCTCATCGGACAGATGACGCGTTCCTCCTGAGGTTTCTCCGAGCGAGAGACTTCATAGTACCGAGAGCGCATAAACTCCTAGTCCGCTATTACACGTTCCGGGCGGAATATCCCCATTTGTATAAAGACGTCGATCTATGGGGCTTGATGAAGGTCAAAAGTGCTTACGAGGGCTCTATGGTGGACCGACCCGATATAGGAAGACTCTCTATCTTCAGATTTGGAACATGGGATCCTAACGAGTTTCCAGTAGACGATCTGATTCGAACAGGGATGGCCATCACTGAAATAGGTATCCGTCAGCCCAAGTTACAGATCATGGGCGGAACAGTTATCGTTGATTTGGAAGGAATAACACTCCGACACGCCGCAACCCTGACTCCGACCATAGCGTATCAGATAGTCTGTTTAATGGGGCTGGTAACACCGACCCGTATTAAAAGCGTTCACATCATAAACTACTCCTGGGTTCTCAACACATTCTTCTACCTCTTCAAGAAGTTCATACCTCAAGAGGCCTGGAGTAAGATTATCTTCCACGGCAGTGACTTGAAATCCCTTCAGCAACAAATCGACCCAGAATGCCTTCCAGCCAGATATGGCGGTACGTGTAGAAATCACGTCCCGATCGGCGTCTGGTTGCAGAAAATCAAAAAATATCGTGACGAACAATTCGATAGAGAAATGAAAGCGCTCGGATACGTCGTCAAAGAGTGA

Protein sequence:

>DPOGS200022-PA
MYECFLEIAFEAELNTKEDPELLELAPELCNEDASTRATAVTNLRNMIFERAECKPHRTDDAFLLRFLRARDFIVPRAHKLLVRYYTFRAEYPHLYKDVDLWGLMKVKSAYEGSMVDRPDIGRLSIFRFGTWDPNEFPVDDLIRTGMAITEIGIRQPKLQIMGGTVIVDLEGITLRHAATLTPTIAYQIVCLMGLVTPTRIKSVHIINYSWVLNTFFYLFKKFIPQEAWSKIIFHGSDLKSLQQQIDPECLPARYGGTCRNHVPIGVWLQKIKKYRDEQFDREMKALGYVVKE-