Monarch geneset OGS2.0

DPOGS215399
TranscriptDPOGS215399-TA900 bp
ProteinDPOGS215399-PA299 aa
Genomic positionDPSCF300088 + 271823-275977
RNAseq coverage16x (Rank: top 81%)
Annotation
HeliconiusHMEL0036761e-7948.44% 
BombyxBGIBMGA012432-TA9e-10963.19% 
DrosophilaCG10026-PB2e-4234.15% 
EBI UniRef50UniRef50_D6WK773e-4736.50%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WK77_TRICA
NCBI RefSeqXP_001606243.11e-5137.86%PREDICTED: similar to CRAL/TRIO domain-containing protein [Nasonia vitripennis]
NCBI nr blastpgi|3454941534e-5038.10%PREDICTED: alpha-tocopherol transfer protein-like isoform 2 [Nasonia vitripennis]
NCBI nr blastxgi|3454941551e-5037.86%PREDICTED: alpha-tocopherol transfer protein-like isoform 1 [Nasonia vitripennis]
Group
Gene OntologyGO:00068102.3e-15transport
GO:00056222.3e-15intracellular
GO:00052152.3e-15transporter activity
KEGG pathway 
InterPro domain[95-282] IPR0012513.1e-36Cellular retinaldehyde-binding/triple function, C-terminal
[61-83] IPR0010712.3e-15Cellular retinaldehyde binding/alpha-tocopherol transport
[6-91] IPR0110741.4e-14Phosphatidylinositol transfer protein-like, N-terminal
Orthology groupMCL30926 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215399-TA
ATGCCGTTCATAGACATAGCGTTCCAGGCTGAAGTCAGCAGGTTTGAGGACCAGGAGTTCGAGGAGTGCGCCAGGAGGAACTGCTCCGAGGACCCAGAGACCAGGCAGACAGCCATTCACAACCTACGGAACCTTATATACCAACGTGGTGAGTGTAATCCGAGAAGACTAGACGACGCGTACCTCCTGCGATTCCTGAGATGCAGGAGATTCATACCAGCTCTGGCACACAAACTGATTGTCCGATACGAAGAGTTCCGTCGCAAGAACTCCTACTTGTATGACTGCAAGGCCTTCGGTCTCCAGAAGGTGAAGGGGGTATACGGAGGCACCCTGCCGGAGAGTCCGCACCACGGCAGGATCACACTCATGAGGTTCGGTCGTTGGGACACGGAAGCCGTCCCCGTGGTGGACGTGGTCCGTTGCGCTCTGCTGATGGACGAGATCGCGGTCATGCAGCCCAAATTACAAATCCTAGGCGTCACCATCATAGTCGACCTCGAGGGTCTCAGTGTGAGACATGTCAGACACCTCACACCCACTATAGCGCATCAGATCGTCAGCCTCATGGGGGTGTCCTTCCCCCTGCTGCTGCACGGCCTCCATATAGTCCGCTACAACTGGATCCTGAACACTTTCTTCTACGTGTTCAAGCAGTTCATACCGACTGCGGCCTGGGAGAGGGTCCATTTCCACGGACACGACATGGCCTCCCTCCACAGACACATACCCCCCGAGTACCTCCCGCCGGAGTACGGGGGCTGCTGCCCTCACGTGGTGGAGGTCGACGAGTGGGTCAGGAAGGTCGACAGGTTCAAGGACGACTTCATGGTCAACGAGCTCAGGGAGCTGGGCTTCACCGTCGACGATCAGCAGTATAATAAATTATATGAAATATGA

Protein sequence:

>DPOGS215399-PA
MPFIDIAFQAEVSRFEDQEFEECARRNCSEDPETRQTAIHNLRNLIYQRGECNPRRLDDAYLLRFLRCRRFIPALAHKLIVRYEEFRRKNSYLYDCKAFGLQKVKGVYGGTLPESPHHGRITLMRFGRWDTEAVPVVDVVRCALLMDEIAVMQPKLQILGVTIIVDLEGLSVRHVRHLTPTIAHQIVSLMGVSFPLLLHGLHIVRYNWILNTFFYVFKQFIPTAAWERVHFHGHDMASLHRHIPPEYLPPEYGGCCPHVVEVDEWVRKVDRFKDDFMVNELRELGFTVDDQQYNKLYEI-