Monarch geneset OGS2.0

DPOGS206917
TranscriptDPOGS206917-TA933 bp
ProteinDPOGS206917-PA310 aa
Genomic positionDPSCF300001 - 1416236-1419331
RNAseq coverage269x (Rank: top 40%)
Annotation
HeliconiusHMEL0106327e-16182.58% 
BombyxBGIBMGA013016-TA1e-14170.00% 
DrosophilaCG2663-PB1e-10155.99% 
EBI UniRef50UniRef50_E2A8W68e-11258.31%Alpha-tocopherol transfer protein-like n=2 Tax=Endopterygota RepID=E2A8W6_CAMFO
NCBI RefSeqNP_001040355.13e-12967.54%cellular retinaldehyde-binding protein [Bombyx mori]
NCBI nr blastpgi|1140515906e-12867.54%cellular retinaldehyde-binding protein [Bombyx mori]
NCBI nr blastxgi|1140515902e-12467.54%cellular retinaldehyde-binding protein [Bombyx mori]
Group
Gene OntologyGO:00068103e-17transport
GO:00056223e-17intracellular
GO:00052153e-17transporter activity
KEGG pathway 
InterPro domain[96-279] IPR0012513.2e-45Cellular retinaldehyde-binding/triple function, C-terminal
[54-76] IPR0010713e-17Cellular retinaldehyde binding/alpha-tocopherol transport
[14-93] IPR0110742.4e-16Phosphatidylinositol transfer protein-like, N-terminal
Orthology groupMCL26029 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206917-TA
ATGGCGTCAGCTGCCACTCTAGTCCAGCCCACAGGTGAGATGTGGAAGAAGATTCGGGTGGAACTTAACGAGGATGTTAACACCCGGGATAACGACTTGGCAGCAATCAAAGAATGGCTGCGAAAGCAACCGCATCTTCCCGACTCCTGGGATGATGGCCGCACAATGACATTTCTTAGAGGCTGCAGCTTCTCTTTGGAAAAATGCAAAAGAAAATTAGACATGTACTTTACAATGAGAGCAGCTTGTCCAGAATTTTTCCAGGATAGAGATGTTAGTAGACCGGAGCTAGCAAATTTGGTTACTAAAGTGCAAGGAGCTCCATTACCAGGTTTAACTCCAAACGGAAGACGAGTAACAGTATGCCGTGGTCTTGATAAGAATATAGACGCGGATGAGTTGAACAACGTTTTCAAAGTCGCGCTTATGATAGGAGATGTTCGTCTAAAAGAGGAGTTAGAGGGAGTTGGTGGCGACATATACATCCTAGACGCATCAGTTGTGTCTCCAAGCCATTTAGCAAAGTTGTCTCCGTCGGCGATAAAGAAGTTTTTAATATGTGTTCAGGAGGCGTATCCAGTCAAGTTAAAAGAAGTACATGTCGTGAATACGTCACCCATAATCGAAACCTTAATAAACTTTATTAAGCCATTCCTTAAGGATAAAATTAAAAACAGGATATTTATTCACTCTGACATAAACACAATATACGATCACGTGCCAAGGGATATGCTTCCAGAAGAATATGGCGGCAATGGTGGTTCATTAGACGAAGTAAATAGGGAATGGATGAAAAAGTTAGCTGATTACACCCAGTGGTTCAAAGAACAGGAATCCGTGAAAGCCAACGAAGCGCTAAGACTGGGGAAACCCACCAATTACGATGAGCTCTTCGGTATTGACGGATCATTTAGACAATTGTCTATCGATTAA

Protein sequence:

>DPOGS206917-PA
MASAATLVQPTGEMWKKIRVELNEDVNTRDNDLAAIKEWLRKQPHLPDSWDDGRTMTFLRGCSFSLEKCKRKLDMYFTMRAACPEFFQDRDVSRPELANLVTKVQGAPLPGLTPNGRRVTVCRGLDKNIDADELNNVFKVALMIGDVRLKEELEGVGGDIYILDASVVSPSHLAKLSPSAIKKFLICVQEAYPVKLKEVHVVNTSPIIETLINFIKPFLKDKIKNRIFIHSDINTIYDHVPRDMLPEEYGGNGGSLDEVNREWMKKLADYTQWFKEQESVKANEALRLGKPTNYDELFGIDGSFRQLSID-