Monarch geneset OGS2.0

DPOGS207039
TranscriptDPOGS207039-TA924 bp
ProteinDPOGS207039-PA307 aa
Genomic positionDPSCF300001 + 1766132-1773568
RNAseq coverage750x (Rank: top 17%)
Annotation
HeliconiusHMEL0068532e-11676.76% 
BombyxBGIBMGA012979-TA4e-4667.36% 
DrosophilaCG2663-PB5e-4131.43% 
EBI UniRef50UniRef50_Q8I0997e-3931.43%CG2663, isoform B n=24 Tax=Neoptera RepID=Q8I099_DROME
NCBI RefSeqXP_001607936.19e-4029.77%PREDICTED: similar to ENSANGP00000012173 [Nasonia vitripennis]
NCBI nr blastpgi|3072060981e-3830.94%Alpha-tocopherol transfer protein-like [Harpegnathos saltator]
NCBI nr blastxgi|1565458605e-3829.77%PREDICTED: clavesin-2-like [Nasonia vitripennis]
Group
Gene OntologyGO:00068101.8e-08transport
GO:00056221.8e-08intracellular
GO:00052151.8e-08transporter activity
KEGG pathway 
InterPro domain[95-280] IPR0012517.2e-30Cellular retinaldehyde-binding/triple function, C-terminal
[11-92] IPR0110742.8e-09Phosphatidylinositol transfer protein-like, N-terminal
[53-75] IPR0010711.8e-08Cellular retinaldehyde binding/alpha-tocopherol transport
Orthology groupMCL26027 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207039-TA
ATGGCGTTTCTCGAAGGTCCTTCTCCAAAGCAGGAGGAATTTATTAAACAGGAACTCGGTGAAGGTCCTGGGGATTTAGAAAGAGGTTTGAGGGACCTACACAGCATGTGCGCAGCCAACCCTTATCTGCCCCAACCCGAAGCCTTAGACACGAATCTATTGAAAACATTCCTACGGGGCTGTCGCATGGACTTTGAAAGAGCGCGAAAAAAGTTAGAAACTTTTTGCTACTCGCGCTCAAGATACAGAGACCTTTTCGAGCATCGTTCACTGAATGAACCACCATTGAATGACGTTTGCAAATTCTTAGATATAGTGCCTCTTCCAAAACTTACGGACGAGGGTTTGAGAGTCACAATATTCAGGGTTCGTCCAAACTACCCTGAAAGTTCTTCTGATATACTGGCTGCTGTAAGAGCCGTATTGCTCATATGTGATGTGAGATTAAGAGACGAGACGCTAATAGCTGGAGACGTATTCATTTGGGAGGCTACCCACGTGCGCGCGTCTATTGCTGCCCGCGTGGCCGCCGCAGCAGGAGCGGTCCGCCGCAGTATACAGCTAGCGCAGGCCGCGTACCCGCAGAGAATGCGCCGCATTCACGTCGTCGGCGCGCCCGCACTCGTCGCCTCCTCACTCAACCTCATGAGAGCATGTGTCAACGAGAAAGTTAGGAAACGATATTATCTACACGACAAGACAGAGGAGCTTCTAGAGCACATCCCAGCAAGAGTATTGCCAGTTGAATGGGGAGGCGAAGAGGAATCGATTGAGGTGTTAAATAGGAAATGGAGACGCCGCGTCGATGAAATGCGCGACTATCTTAGAGACCTCAGCGAGCTGTGTGACGTCACCCCGGACACGCTCTACGACAACGACATCTATGGCGCAGTAGGATCGTTCAGGAAACTGGACATTGATTAG

Protein sequence:

>DPOGS207039-PA
MAFLEGPSPKQEEFIKQELGEGPGDLERGLRDLHSMCAANPYLPQPEALDTNLLKTFLRGCRMDFERARKKLETFCYSRSRYRDLFEHRSLNEPPLNDVCKFLDIVPLPKLTDEGLRVTIFRVRPNYPESSSDILAAVRAVLLICDVRLRDETLIAGDVFIWEATHVRASIAARVAAAAGAVRRSIQLAQAAYPQRMRRIHVVGAPALVASSLNLMRACVNEKVRKRYYLHDKTEELLEHIPARVLPVEWGGEEESIEVLNRKWRRRVDEMRDYLRDLSELCDVTPDTLYDNDIYGAVGSFRKLDID-