Monarch geneset OGS2.0

DPOGS214230
TranscriptDPOGS214230-TA927 bp
ProteinDPOGS214230-PA308 aa
Genomic positionDPSCF300014 + 899032-902115
RNAseq coverage0x (Rank: top 99%)
Annotation
HeliconiusHMEL0128377e-9965.18% 
BombyxBGIBMGA005954-TA2e-10257.47% 
DrosophilaCG33966-PA8e-8646.10% 
EBI UniRef50UniRef50_Q16ZA04e-9049.84%SEC14, putative n=6 Tax=Endopterygota RepID=Q16ZA0_AEDAE
NCBI RefSeqXP_320369.45e-9350.32%AGAP012165-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1583004509e-9250.32%AGAP012165-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1583004508e-8950.32%AGAP012165-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00068101.3e-17transport
GO:00056221.3e-17intracellular
GO:00052151.3e-17transporter activity
KEGG pathway 
InterPro domain[91-277] IPR0012511.8e-46Cellular retinaldehyde-binding/triple function, C-terminal
[50-72] IPR0010711.3e-17Cellular retinaldehyde binding/alpha-tocopherol transport
[10-84] IPR0110743e-16Phosphatidylinositol transfer protein-like, N-terminal
Orthology groupMCL18018 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214230-TA
ATGTCTATACGACCTTTGTGTCCAGAGTTGGCTAAAAAGGCTCGAGAGGAATTAAACGAAGATGCAAAAACTATTGAAAGCGATCTTCGAAGCATTAAAGATTGGTTGTCCAAACAACCACATTTAAGAGCAAGAACAGATGATCAGTGGCTTGTCGCTTTTTTAAGAGGATGCAAGTACAGTCTAGAGCGCACGAAAGAAAAATTAGACCTATATTATTCTATGAGGTCGTTGGCACCAGAACTATTTAGGGTGAAGGCTACTGATTCTGTTTTTGATGAATTAATCAGTTTGGGGACTTACCTGATACTGCCGAAAACCGCTACCCCTGATTCACCGAGGGTTATCATAATTCGAGCTGGTTGTTATGATCCCGCTAAATACAACTTTATTGACATATTCTCTGCTACTGCACACATACAGAAGATTCTCATTTTCGAAGATGACGCAATTGTTGTATCTGGTTTTAAAACAATTATGGACATGGAAGGCATCACTCTCGCACACTTATTGCAAATCACGCCCAGCGTTATGAAGAAGATGGCTGTTCTTTCACAGGACGCCTGGCCGCTACGTATGAAAGGAGCACATTACATTAATACACCGTCATGGTTTGATAATTTTTTTAACATGGTTAAAAATTTGTTAAATGAAAAAAATAGACAGCGTCTTTACGTACATAATAAAAATTTCGAAGAACTATACAAACATATTCCTCAGGAAATATTACCAAATGAATATGGTGGAAATGGTGGTAATATTAAGGAGATTTCAGAATATTGGAAGGCTAAGGTACAAGAGTATAGCTCGTGGTTAGAAGATGATTTAAAATACGGTTCGGACGAATCAAAGCGAGTGGGAAACCCAAGGACGGCTGAGACATTGTTTGGGGTCGAGGGTTCTTTCAGACAACTGGAGTTTGATTAA

Protein sequence:

>DPOGS214230-PA
MSIRPLCPELAKKAREELNEDAKTIESDLRSIKDWLSKQPHLRARTDDQWLVAFLRGCKYSLERTKEKLDLYYSMRSLAPELFRVKATDSVFDELISLGTYLILPKTATPDSPRVIIIRAGCYDPAKYNFIDIFSATAHIQKILIFEDDAIVVSGFKTIMDMEGITLAHLLQITPSVMKKMAVLSQDAWPLRMKGAHYINTPSWFDNFFNMVKNLLNEKNRQRLYVHNKNFEELYKHIPQEILPNEYGGNGGNIKEISEYWKAKVQEYSSWLEDDLKYGSDESKRVGNPRTAETLFGVEGSFRQLEFD-