Monarch geneset OGS2.0

DPOGS206988
TranscriptDPOGS206988-TA927 bp
ProteinDPOGS206988-PA308 aa
Genomic positionDPSCF300001 + 565915-568972
RNAseq coverage77x (Rank: top 65%)
Annotation
HeliconiusHMEL0033651e-6441.10% 
BombyxBGIBMGA010117-TA7e-5333.97% 
DrosophilaCG12926-PA6e-2730.17% 
EBI UniRef50UniRef50_UPI00017933DC4e-2729.69%UPI00017933DC related cluster n=1 Tax=unknown RepID=UPI00017933DC
NCBI RefSeqXP_001851205.11e-3027.22%CRAL/TRIO domain-containing protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700473892e-2927.22%CRAL/TRIO domain-containing protein [Culex quinquefasciatus]
NCBI nr blastxgi|910848151e-2828.94%PREDICTED: similar to CRAL/TRIO domain-containing protein [Tribolium castaneum]
Group
Gene OntologyGO:00068101.5e-07transport
GO:00056221.5e-07intracellular
GO:00052151.5e-07transporter activity
KEGG pathway 
InterPro domain[96-279] IPR0012518.8e-21Cellular retinaldehyde-binding/triple function, C-terminal
[177-198] IPR0010711.5e-07Cellular retinaldehyde binding/alpha-tocopherol transport
[17-97] IPR0110742.8e-07Phosphatidylinositol transfer protein-like, N-terminal
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206988-TA
ATGGATTCATTAACGGTTAGTCCAATCTTACAACACAACGCAGATGCCTTGCAATACATCAGAAGGCAATATAATTTGGATCTACCTGAAAGGATTCAAGAGTCCGTTAATATCTTGTCTGATTGGATAAATAAACAAGACCATCTGGTTAAGAAAAATTATGGCAGTGATTACTTAGAAAGGATACTTATAAACTGTAAAGGTTCTGTGGAGAAAGCTAAGATGAAATTGGACAAGTTATGCTCGTTAAGAACGGCCTTGCCGATATATTTCGAACCATTCGATAGAAAACATCCTGCTGTAAACAAAATAGTTGATGGGTTCTTGCCAAAAATGACGCCCGACCATTGTAGAGTGTTCCTCTTAAGAAACAACATTCAAAAATTTGATTTCGTTCTCTTGGACTTCTACCGAGCCCTGATTGCTAAAGTCGAGTACTTACAAACACATGACTATAACAATGGGATCATAGTAATTTTTGATTATAGAGGTTTAAACATATTAGACTTCATGAAAATTTTTAATATAACAGAAATAGAAGACAACACAAACATTATAACGGAAGGTTATGGAATGAGGATTAAAAGTATTCACATTATAACGAGCTCCAATTTGATCGATATGGTTGTTTCTATTATAAAACAGGGTATGAGCGAAAAATTGGGGAAAAGAATTAATGTTCACAAAAGTCTAGATTCAATATACAACTTCGTACCTAAAGACATCATGCCAGAAGATTATGGAGGATCGGAGAAAACCTTGCAACAACTCCATGAAAACTTATTAAATGAGCTTACTTCATACAAATTCAAAAATCACTTAACAGATATGAGGAATGCCTGTACCAATGAAATTGTACGACGTGAGAATATCCAAAATCAATACCTTGGAATTCCAGGATCATTTAGAAAGTTAGCTGTTGATTAA

Protein sequence:

>DPOGS206988-PA
MDSLTVSPILQHNADALQYIRRQYNLDLPERIQESVNILSDWINKQDHLVKKNYGSDYLERILINCKGSVEKAKMKLDKLCSLRTALPIYFEPFDRKHPAVNKIVDGFLPKMTPDHCRVFLLRNNIQKFDFVLLDFYRALIAKVEYLQTHDYNNGIIVIFDYRGLNILDFMKIFNITEIEDNTNIITEGYGMRIKSIHIITSSNLIDMVVSIIKQGMSEKLGKRINVHKSLDSIYNFVPKDIMPEDYGGSEKTLQQLHENLLNELTSYKFKNHLTDMRNACTNEIVRRENIQNQYLGIPGSFRKLAVD-