Monarch geneset OGS2.0

DPOGS214231
TranscriptDPOGS214231-TA933 bp
ProteinDPOGS214231-PA310 aa
Genomic positionDPSCF300014 + 902996-905511
RNAseq coverage80x (Rank: top 64%)
Annotation
HeliconiusHMEL0128377e-10169.04% 
BombyxBGIBMGA005954-TA2e-8955.09% 
DrosophilaCG33966-PA4e-7544.79% 
EBI UniRef50UniRef50_Q16ZA04e-7948.25%SEC14, putative n=6 Tax=Endopterygota RepID=Q16ZA0_AEDAE
NCBI RefSeqXP_320369.44e-8348.60%AGAP012165-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1583004507e-8248.60%AGAP012165-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1583004505e-8448.60%AGAP012165-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00068104.2e-15transport
GO:00056224.2e-15intracellular
GO:00052154.2e-15transporter activity
KEGG pathway 
InterPro domain[91-277] IPR0012515.3e-43Cellular retinaldehyde-binding/triple function, C-terminal
[10-84] IPR0110743e-16Phosphatidylinositol transfer protein-like, N-terminal
[50-72] IPR0010714.2e-15Cellular retinaldehyde binding/alpha-tocopherol transport
Orthology groupMCL18018 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214231-TA
ATGTCTATACGACCTTTGTGTCCAGAGTTGGCTAAAAAGGCTCGAGAGGAATTAAACGAAGATGCAAAAACTATTGAAAGCGATCTTCGAAGCATTAAAGATTGGTTGTCCAAACAACCACATTTAAGAGCAAGAACAGATGATCAGTGGCTAGTGGCGTTTTTAAGGGGATGTAAGTACAGCCTAGAGCGCACGAAAGAAAAGTTGGACCTTTATTACTCAATGAGATCTTTGGCACCAGAACTATTTCGACTTAAGGCTTCGGATCCACTTTTCAATGAAATCATGGATTTAGGGACTTATGTAACTCTCTTGAAAACCGCAACACCCGACTCACCGAGGATTATAATTATTCGGGCTGGTAGTTATGATCCCGCTAAGTATAACTTCCTCGACATATTCTCTGCAGCATCAATTATACAGCGCATCCTTATATACGAAGATGACGCGACTATTATATCAGGTTTTAAAACAATTATGGACATGGAAGGGGTCACCCTTGCACATTGGTTGCAAATAACACCGAGTTCTATGAAGAAGATGGCTGTACTTTCTCAGGACGCGGGACCAGTGCGTATGAAAGGCACACATTATATCAATACACCTCCAGGATTTGAAAATATTTTTGGCATTATTAAAAATGTGCTCAACGAGAAAAACAGAAATAGGCTTTACGTCCATAACAAAAATTATGAAGAACTATACAAACATATTCCTCAGGAAATATTACCAAATGAATATGGTGGAAATGGTGGTAATATTAAGGAAATTTCAGAATATTGGAAGGCCAAGGTACAAGAGTATAGCTCATGGTTAGAAGATGATTTAAAATACGGTTCGGACGAATCAAAGCGAGTGGAAAACCAAGGACGGCTGAGATATTGTTTGGGGTCGAGGGTTCATTCAGACAATTGGAATTTGATTAACTTATAA

Protein sequence:

>DPOGS214231-PA
MSIRPLCPELAKKAREELNEDAKTIESDLRSIKDWLSKQPHLRARTDDQWLVAFLRGCKYSLERTKEKLDLYYSMRSLAPELFRLKASDPLFNEIMDLGTYVTLLKTATPDSPRIIIIRAGSYDPAKYNFLDIFSAASIIQRILIYEDDATIISGFKTIMDMEGVTLAHWLQITPSSMKKMAVLSQDAGPVRMKGTHYINTPPGFENIFGIIKNVLNEKNRNRLYVHNKNYEELYKHIPQEILPNEYGGNGGNIKEISEYWKAKVQEYSSWLEDDLKYGSDESKRVENQGRLRYCLGSRVHSDNWNLINL-