Monarch geneset OGS2.0

DPOGS202138
TranscriptDPOGS202138-TA879 bp
ProteinDPOGS202138-PA292 aa
Genomic positionDPSCF300193 - 13034-25566
RNAseq coverage710x (Rank: top 18%)
Annotation
HeliconiusHMEL0146285e-11979.35% 
BombyxBGIBMGA001511-TA2e-9273.21% 
DrosophilaCG3823-PA5e-5838.03% 
EBI UniRef50UniRef50_D6W9C36e-8148.12%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6W9C3_TRICA
NCBI RefSeqXP_969460.11e-8148.12%PREDICTED: similar to CG3823 CG3823-PA [Tribolium castaneum]
NCBI nr blastpgi|910770742e-8048.12%PREDICTED: similar to CG3823 CG3823-PA [Tribolium castaneum]
NCBI nr blastxgi|910770742e-7948.12%PREDICTED: similar to CG3823 CG3823-PA [Tribolium castaneum]
Group
Gene OntologyGO:00068102.6e-07transport
GO:00056222.6e-07intracellular
GO:00052152.6e-07transporter activity
KEGG pathway 
InterPro domain[76-260] IPR0012513.7e-44Cellular retinaldehyde-binding/triple function, C-terminal
[159-180] IPR0010712.6e-07Cellular retinaldehyde binding/alpha-tocopherol transport
[12-73] IPR0110741.1e-06Phosphatidylinositol transfer protein-like, N-terminal
Orthology groupMCL18423 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202138-TA
ATGATGCCGGACCAGGAATTGGAATGGAACACAGCCCTCGTTCCGATAAAGGATTGGTTGGTCAAACAACCTCATCTACCCAACGATATAGGTGATGTGCTCCTTCGTCGATTCTACGTGAGCTGCGATCGTTCTATGGAGCGAGTAAAGAGAACGATTGATCTATTCTTCACAATCCGATCAACAGCACCAGAACTGTTCCTTAAAAGGGACCCCTGGGCGCCGGAAATACGAAGAGTCTTTGAAATAACTGACATGTTGCCGCTACCAAATAAGACGAAGGAGAACTACAAAGTGTTTGTATACCGTCTGAACAATCCCGACTACGATCTGTTCAACTTCATAGACGCTATCAAGGTGTTCTTCATGTTGGCTGACACTCGACTCACAGAGGAAGACGATATTCCTTCAGGAGAAATACCGATATTCGATGCTACGAACGTCACCCTCAAATTCATGGGAAAAATCAATCTGACAGTACTCAGGAAATATATGCTTTACACACAGGAAGCTCTGCCAATAAGACTAAAACAGGTTCATATAATAAACGCTCCGCCTTACATCGGGAAGTTATACGCGATTTGCAAACCGTTCATCAACACTGAGGTCGCTAAGCTGATAAAATTCCACGAGCCGAATTCAAGCACGTTGTACGAAGACGTACCAGTTGAGATATTACCGGACGAACTCGGAGGTTCCGCTGGCACCATCGAGCAGATCAAGAAGTACTGGATCAAGAGAATCGAAGCCAAGAGGGACTGGTTCCTCTCGAATGACAAGGAGTGGTGTGTGGATGAGGCGCTTCGTCCCCGAGAAACCACGAGGGATGAAAGGACTACTGACCTCCCCGGATCATTCCGATCACTAGCCTTCGATTAA

Protein sequence:

>DPOGS202138-PA
MMPDQELEWNTALVPIKDWLVKQPHLPNDIGDVLLRRFYVSCDRSMERVKRTIDLFFTIRSTAPELFLKRDPWAPEIRRVFEITDMLPLPNKTKENYKVFVYRLNNPDYDLFNFIDAIKVFFMLADTRLTEEDDIPSGEIPIFDATNVTLKFMGKINLTVLRKYMLYTQEALPIRLKQVHIINAPPYIGKLYAICKPFINTEVAKLIKFHEPNSSTLYEDVPVEILPDELGGSAGTIEQIKKYWIKRIEAKRDWFLSNDKEWCVDEALRPRETTRDERTTDLPGSFRSLAFD-