Monarch geneset OGS2.0

DPOGS214233
TranscriptDPOGS214233-TA1107 bp
ProteinDPOGS214233-PA368 aa
Genomic positionDPSCF300014 + 918484-926205
RNAseq coverage63x (Rank: top 68%)
Annotation
HeliconiusHMEL0128371e-9771.55% 
BombyxBGIBMGA005955-TA4e-10758.96% 
DrosophilaCG33966-PA6e-7043.32% 
EBI UniRef50UniRef50_Q2MGL88e-6843.32%CG33966 n=25 Tax=Drosophila RepID=Q2MGL8_DROME
NCBI RefSeqXP_002093088.17e-6943.32%GE21129 [Drosophila yakuba]
NCBI nr blastpgi|1954903161e-6743.32%GE21129 [Drosophila yakuba]
NCBI nr blastxgi|1578168019e-6643.32%RE21492p [Drosophila melanogaster]
Group
Gene OntologyGO:00068108.2e-13transport
GO:00056228.2e-13intracellular
GO:00052158.2e-13transporter activity
KEGG pathway 
InterPro domain[156-340] IPR0012511.4e-42Cellular retinaldehyde-binding/triple function, C-terminal
[74-153] IPR0110741.3e-14Phosphatidylinositol transfer protein-like, N-terminal
[114-136] IPR0010718.2e-13Cellular retinaldehyde binding/alpha-tocopherol transport
Orthology groupMCL30391 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214233-TA
ATGGTTCGTGAATTGACCCCCGAGCTGTCTGCGATAGCAAAAAAAGATTTAAATGAGCATCCAAAGCAGCTTGCAAATGATTTAACAAATTTAAAAGAATGGATATCAAAACAACCTCATTTAAAAGCGAGAACTGGTGTTAATTTATCCAGTCCGGAGTCACAACGTCGCAGACGACTGTTATCGTTTTGTGCAATGGTTCGTGAATTGACCCCCGAGCTGTCTGCGATAGCAAAAAAAGATTTAAATGAGCATCCAAAGCAGCTTGCAAATGATTTAACAAATTTAAAAGAATGGATATCAAAACAACCTCATTTAAAAGCGAGAACTGATGACCAATGGCTGGCTGCTTTGCTGAGGGGTTGTAAGTTCAGCTTAGAACGTGCCAAGAGTAAATTGGACTTATTTTATACTTTGCGTTCCACCGCTCCCGATGTCACCTTAAGATTGAAGCCAACGGAACCAGCATTTATAGAATTTCTCAGACTTGGAACATGTCTAATACTACCTCAACCAAAAAATCTGCATCCGACTGTTATAATGATAAGGCCGGGTGCCTTTGATCCAGAAAAATATAATGGTGCTGATATAATGTGTATTTTATACTACTTAGTACAGATATTAGTCATGGAAAACGACGTGGCCGCTGTGATGGGAACCATGATACTGGTTGATTACCAGAATGTTACGATGGGTCATTTAACACAAGCAAATCCTAGCCTTTTGAAGAAATTGGTAGCCGTCAGCCAGGACTCGTTGCCGCTGCGTCTCAAAGGTAGCCATCATGTGAATGTCCCTCCTGGTATAGAAATTATTTTTAAACTTGTATCTGGCTTTTTGGGAGAAAAAGCAAAACAGAGACTGCGAATTTATAAATGCTATGACGAGTTACTTGAAATTTTACCAAAGGACACAGTTCCAGTTGAATATGGAGGAAGTGGCGGCTCAGTAAAAGAAATTATAGAATATTGGGAGAATAAGATCGTAGAGTACAGGCCGTGGTTGGAGGAAGAGATGAAGTACGGCACTGACGAGAGCAAAAGAGTGAATAAAGATAATTTTGACGTCAGTAATCAGGGTTCATTCAGAACGTTAGATATAGATTAA

Protein sequence:

>DPOGS214233-PA
MVRELTPELSAIAKKDLNEHPKQLANDLTNLKEWISKQPHLKARTGVNLSSPESQRRRRLLSFCAMVRELTPELSAIAKKDLNEHPKQLANDLTNLKEWISKQPHLKARTDDQWLAALLRGCKFSLERAKSKLDLFYTLRSTAPDVTLRLKPTEPAFIEFLRLGTCLILPQPKNLHPTVIMIRPGAFDPEKYNGADIMCILYYLVQILVMENDVAAVMGTMILVDYQNVTMGHLTQANPSLLKKLVAVSQDSLPLRLKGSHHVNVPPGIEIIFKLVSGFLGEKAKQRLRIYKCYDELLEILPKDTVPVEYGGSGGSVKEIIEYWENKIVEYRPWLEEEMKYGTDESKRVNKDNFDVSNQGSFRTLDID-