Monarch geneset OGS2.0

DPOGS206942
TranscriptDPOGS206942-TA936 bp
ProteinDPOGS206942-PA311 aa
Genomic positionDPSCF300001 - 603417-606990
RNAseq coverage93x (Rank: top 62%)
Annotation
HeliconiusHMEL0033652e-8448.72% 
BombyxBGIBMGA012885-TA1e-5238.46% 
DrosophilaCG2663-PB5e-2828.62% 
EBI UniRef50UniRef50_D4QDB47e-2730.21%CRAL-TRIO domain containing protein n=2 Tax=Ponerini RepID=D4QDB4_9HYME
NCBI RefSeqXP_001851205.11e-3130.98%CRAL/TRIO domain-containing protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700473892e-3030.98%CRAL/TRIO domain-containing protein [Culex quinquefasciatus]
NCBI nr blastxgi|1700473897e-2930.90%CRAL/TRIO domain-containing protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00068104.6e-10transport
GO:00056224.6e-10intracellular
GO:00052154.6e-10transporter activity
KEGG pathway 
InterPro domain[97-280] IPR0012516.9e-25Cellular retinaldehyde-binding/triple function, C-terminal
[179-200] IPR0010714.6e-10Cellular retinaldehyde binding/alpha-tocopherol transport
[17-97] IPR0110741.1e-06Phosphatidylinositol transfer protein-like, N-terminal
Orthology groupMCL23328 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206942-TA
ATGGAATCTATACCAAAGGACTCTTTATTGGAATTTAATCCAGATACTCTTCAGTTTCTTAGAAAACAGTACAATCTGGATACACCCGGCAGAATTGAAGAAGCAATTGAAATTTTGTCAGAATGGCTCAAAAAACAAAATCATTTCCTAAGAAAAGAATTTCCAAAAGATTATTTAGAAAGAACAATTATAATAAGCAAAGGTTCTGTTGAGAGGGCAAAAAAGAAGCTGGACAGTATATGCACCTATCGGACCCTGTTCCCTGAATATTTTACCGTCTTTAATGTTCAGGAAAACGAGCTCTTGGATGAATTTTATGGGTCTTTTCTACCCAAATTAACAAGCGATCACTACAGGGTATATGCTGTGAGAAACAAAATAAAGAAAACATGTGAAAGTGGATTTTTAGATTTCTACCGATTTTTCCTCATGCAATGCGAATATGTCCAAGCGCATGACTACTGCAATGGTCTTATTATTTTCATTGACTATTCCGACGCAAACATCATGGAGTCGGTAAAATGGTTTAACATAAGTGACGTGAAACGTATCCTGGATATAATGAAGGAGGGATACGGTATGAGGATAAAGGGTATACACTTCTATACAGAATCAAAAGCCGTCGATGCCCTTGTAACAATTATCAAACAAGGAGCTAGCCAAAAGGTGGCCGGCAGAGTCCAAGTACATAAAACATTAGACAAAATTTACGAATACATTCCCAAAGACATTATGCCGTTAGAGTACGGAGGTCAAGAAAAACCATTATTTGAACTGCATAAAAAGATGCGACATATTTGTTCTACGGAGTTTAAGAGTTATTTAGAGGAAATAAGACAAGCGGGAACAAATGAAAGCCTGAGACCAGAAGATTCAGTAAATAAATCGCACTATATGGGAATATCCGGGACATTCAGAAATTTAAGTGTTGATTAA

Protein sequence:

>DPOGS206942-PA
MESIPKDSLLEFNPDTLQFLRKQYNLDTPGRIEEAIEILSEWLKKQNHFLRKEFPKDYLERTIIISKGSVERAKKKLDSICTYRTLFPEYFTVFNVQENELLDEFYGSFLPKLTSDHYRVYAVRNKIKKTCESGFLDFYRFFLMQCEYVQAHDYCNGLIIFIDYSDANIMESVKWFNISDVKRILDIMKEGYGMRIKGIHFYTESKAVDALVTIIKQGASQKVAGRVQVHKTLDKIYEYIPKDIMPLEYGGQEKPLFELHKKMRHICSTEFKSYLEEIRQAGTNESLRPEDSVNKSHYMGISGTFRNLSVD-