Monarch geneset OGS2.0

DPOGS201346
TranscriptDPOGS201346-TA933 bp
ProteinDPOGS201346-PA310 aa
Genomic positionDPSCF300176 + 753153-760285
RNAseq coverage170x (Rank: top 51%)
Annotation
HeliconiusHMEL0123892e-14475.57% 
BombyxBGIBMGA003032-TA2e-7347.39% 
DrosophilaCG5973-PC3e-8046.43% 
EBI UniRef50UniRef50_Q9VM114e-7846.43%CG5973, isoform A n=15 Tax=Diptera RepID=Q9VM11_DROME
NCBI RefSeqXP_310039.44e-8448.69%AGAP009364-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582881757e-8348.69%AGAP009364-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582881752e-8049.50%AGAP009364-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00068109.6e-18transport
GO:00056229.6e-18intracellular
GO:00052159.6e-18transporter activity
KEGG pathway 
InterPro domain[113-298] IPR0012513.9e-50Cellular retinaldehyde-binding/triple function, C-terminal
[73-95] IPR0010719.6e-18Cellular retinaldehyde binding/alpha-tocopherol transport
[27-114] IPR0110745.6e-16Phosphatidylinositol transfer protein-like, N-terminal
Orthology groupMCL18812 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201346-TA
ATGGGATTACAATTCGAAACGGATGGTACACCGTACGTTATGTGTGGTGATGAAAAGGTCAAGTTGGAAAAATTTCCCATCACAGAGGATTTCTATGTCGAAAAAGCCAGAAGTGAATTACGTGAAACGGAGGAAAACATTGTACAGGCCCTGAAAGAATTAAGGGAACTGTTGCAACAGGAATCAAATCTTCTGGTGCCTTTCGACAACGATGAGTTTCTAATGAAATTTCTGCGTCCATCAAAATTATATGCGGAGAGCGCATTTAAACGGATACAAGCATATTACAAGTTCAGACTCTCTCACAAGGACTACTGCTGTGATTTGTATCCGAGTAAAGTGCGCTCGGCCTTCGACCATTCCATAGTGTCCATCCTTTCGCCTCGAGACCAACATGGCCGCCGCATCATATATGTGGAATCTGGCGAACGTTGGAATCCTCGTGAGGTACCTTTAAAGGAAGTTTTTCGAGGCATACAGCTGGGTCTGGAGGGAGCGATGGTTGAACCACGTACGCAAGTCTGTGGTGTGGCGGTCGTGCTAAACATGAAGGGCTTGTCATTTTCACAGATAATGCAATTCACGCCCTCATTTGCCAAAATGGTTGTTGATTGGATTCAGGATTGCATTCCGATTCGCCTAAAAGGAGTACACGTGATAAATCAGCCATATATATTCAGTATGTTGTTCGCTATATTCAAACCGTTTCTACGCGAAAAGTTAAGATCTAGAATATTCTTCCACGGCTCCGATAAAGAGTCTTTATTGAAACATATACAACCTGGTTCCTTGATGAGAAGGGTTGGTGGAGACTTACCTGATGATGACATAACTGGTGAAGTTTTGTGGAAGATGCTCAATCATTACGAAGATGAATTCAGACACGCAAATTCCTATGGTTACGTCACTAATAACAACGAAATCAAGAAATAA

Protein sequence:

>DPOGS201346-PA
MGLQFETDGTPYVMCGDEKVKLEKFPITEDFYVEKARSELRETEENIVQALKELRELLQQESNLLVPFDNDEFLMKFLRPSKLYAESAFKRIQAYYKFRLSHKDYCCDLYPSKVRSAFDHSIVSILSPRDQHGRRIIYVESGERWNPREVPLKEVFRGIQLGLEGAMVEPRTQVCGVAVVLNMKGLSFSQIMQFTPSFAKMVVDWIQDCIPIRLKGVHVINQPYIFSMLFAIFKPFLREKLRSRIFFHGSDKESLLKHIQPGSLMRRVGGDLPDDDITGEVLWKMLNHYEDEFRHANSYGYVTNNNEIKK-