Monarch geneset OGS2.0

DPOGS206989
TranscriptDPOGS206989-TA942 bp
ProteinDPOGS206989-PA313 aa
Genomic positionDPSCF300001 + 576261-581834
RNAseq coverage36x (Rank: top 74%)
Annotation
HeliconiusHMEL0110622e-9852.40% 
BombyxBGIBMGA010117-TA6e-6840.89% 
DrosophilaCG2663-PB6e-3328.39% 
EBI UniRef50UniRef50_D4QDB43e-3131.71%CRAL-TRIO domain containing protein n=2 Tax=Ponerini RepID=D4QDB4_9HYME
NCBI RefSeqXP_973232.16e-3630.00%PREDICTED: similar to CRAL/TRIO domain-containing protein [Tribolium castaneum]
NCBI nr blastpgi|910848151e-3430.00%PREDICTED: similar to CRAL/TRIO domain-containing protein [Tribolium castaneum]
NCBI nr blastxgi|910848156e-3430.00%PREDICTED: similar to CRAL/TRIO domain-containing protein [Tribolium castaneum]
Group
Gene OntologyGO:00068101.3e-07transport
GO:00056221.3e-07intracellular
GO:00052151.3e-07transporter activity
KEGG pathway 
InterPro domain[99-283] IPR0012511.3e-25Cellular retinaldehyde-binding/triple function, C-terminal
[181-202] IPR0010711.3e-07Cellular retinaldehyde binding/alpha-tocopherol transport
[17-97] IPR0110748.1e-07Phosphatidylinositol transfer protein-like, N-terminal
Orthology groupMCL18022 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206989-TA
ATGGAGTCAATACCAAGAGGCAGAATCCTGCAAATAAGACCAGACACTTTGGAACAAGTTCGTAAAGAGTATAACCTGGAGAAAAAGGGTAGAATGGAGGAAGCGGTAAAAATGTTAGATGAGTGGATCCAAAAACAGCCGCATTTCAAGAAGAAGGATTTCGATCCATATTTCTTAGAAACTACAATATTAGCAAGCAAAGGATCCCTTGAACGTGCTAAAAGACAAATAGATAAAAATTGCACTCTGAGAACACTTTTACCGGTTTATTTCGGTAATTTTAACGTCAGAAATGATTTTAAAAATATACATGATGTCGTACAGACAGCAGTCTTACCAAAGTTGACGCCTGATCACTACAGAATAGTCCTTACGAAGTTCAATAATATCCCTTTCGATAGTTCAGATGTGATTAATTTTTACAGATATAACGTTATTTTGGGAGATTATTTAAAATCACATGATTATCTGAGGGGCTTCATAGTAATATCGGATTACTCAGACGCAAATATGATGGATTATGTGTCAAAAATTAATCCTATTGATTTAAGAAATGCTTTCAGTATTTATTTAGAAGGGTATGGTATGAGAATAAAAGGCATTCACATCGTATGCTCTTCAAAATTTGTGGACGCGTTTATAACAATACTGAAACAAGTTTTAAGTGAAAAAGTAGCAAACAGAATCTCCGTTCATAAGTCAATAGATGATTTATCTAAAGTCATTCCAAAGGAAATATTGCCTGTGGACTATGGTGGTGATGAAAAGTCGATTAAGACTCTATCTGATGAATGGATTGACGTCCTCTCAACAAAGGAGCATATGGATTATGTAGCCGACATGAATCAAGCTGGTACGGATGAATCACTCAGACAGCCAGATAAATTCTGTGACACATGTGCTGGAATGCCGGGTAGTTTTAGAATATTGAGTGTAGATTGA

Protein sequence:

>DPOGS206989-PA
MESIPRGRILQIRPDTLEQVRKEYNLEKKGRMEEAVKMLDEWIQKQPHFKKKDFDPYFLETTILASKGSLERAKRQIDKNCTLRTLLPVYFGNFNVRNDFKNIHDVVQTAVLPKLTPDHYRIVLTKFNNIPFDSSDVINFYRYNVILGDYLKSHDYLRGFIVISDYSDANMMDYVSKINPIDLRNAFSIYLEGYGMRIKGIHIVCSSKFVDAFITILKQVLSEKVANRISVHKSIDDLSKVIPKEILPVDYGGDEKSIKTLSDEWIDVLSTKEHMDYVADMNQAGTDESLRQPDKFCDTCAGMPGSFRILSVD-