Monarch geneset OGS2.0

DPOGS206941
TranscriptDPOGS206941-TA2019 bp
ProteinDPOGS206941-PA672 aa
Genomic positionDPSCF300001 - 607596-620707
RNAseq coverage535x (Rank: top 24%)
Annotation
HeliconiusHMEL0033653e-7540.32% 
BombyxBGIBMGA012887-TA2e-4331.92% 
DrosophilaCG2663-PB7e-2626.64% 
EBI UniRef50UniRef50_UPI00022477892e-3233.88%UPI0002247789 related cluster n=1 Tax=unknown RepID=UPI0002247789
NCBI RefSeqXP_973232.18e-3128.77%PREDICTED: similar to CRAL/TRIO domain-containing protein [Tribolium castaneum]
NCBI nr blastpgi|3454940598e-3233.88%PREDICTED: alpha-tocopherol transfer protein-like [Nasonia vitripennis]
NCBI nr blastxgi|3454940598e-3333.88%PREDICTED: alpha-tocopherol transfer protein-like [Nasonia vitripennis]
Group
Gene OntologyGO:00068103.8e-06transport
GO:00056223.8e-06intracellular
GO:00052153.8e-06transporter activity
KEGG pathway 
InterPro domain[100-284] IPR0012511.2e-24Cellular retinaldehyde-binding/triple function, C-terminal
[399-420] IPR0010713.8e-06Cellular retinaldehyde binding/alpha-tocopherol transport
Orthology groupMCL23328 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206941-TA
ATGGAAGCCGTTCCTTATCATCCACTTCTAGAAGTAACGAATGAAGAATTTAGATCCACCAGAAAATTCTACGATATCGAAAACATTAAAGTTTTAAACGAAAGCCTCGACTCGGTGGAAGAATGGATAAAGAAACAGGATCATTTGTCTGAAGCCGGTCGACATTTGGATCGCGAATCTTTAGAAAGACTTTTTTTACTCTCTCGAGCTTCTGTAGAGGGAACAAAATTAAAAATTGAAAAATTTTTCACAACCAGGGGAATGATGCCCGAGCTGAATCTCAACAAAAGAATCGAGGAATTTGAAAAACTACTGGACTTTTTGATCTATGTGCCATTGCCGAAGTTACATCCAAAAGATAATTCAAGAGTGATGGTGACGCATATCTTATCAGACAAATCGGAAAATTTCTCACTACTATCTTATTTGAGATTTTGTTTTTTTGTTGGAGAATATCGCATTGGTTATGATTACAGTACAGCCGACAGATATGTTATAGATCTGAAAAATTTTAACATGAATTTACTAACCAAATTGAATCCTATTTTATTAAAAAAGGCTGAAATTTTATGCACGGAATGTATCGGTACGAAGATAAAAGGAATTCATCTCTTAAATGCTCCACCATTTGTGGATAAAATAGTTTTCATACTTAAACAAGGTCTTAAGGAAAAAGTGGCCAGCAGGCTCACCGTCCACACCACATATGAAGATTTCCATAAAGAAGTTCCAAAAGAAATACTACCTAAAGATTTTGATGGCGATGGACCCAGCCTATCAAAGCTTGCTGACCAATGGAAGGACATGCTGAAATCAGATGAAGTGAGAAATGTCAGTGAAAAATTTGACAAACTGATTTCGGATGAATCGAAACGATCAGAAATGAAATTCAACGAAGAATACTTAGGAATGCCCGGGTCTTTTAGAAAATTAACTGTTATATATATTTCTGCAATAAATTTTCTGATTTACAGTTACGGATGTTTTCTGCCAAAATTAACAAACGATCACTACAGAGTGTACGCTGTTAAAAATCAAATCAAGAATACAGTTGAAAGTGGATTCTTAGATTATTATCGTTTTTACTTTATGTTGTGTGAATATGTTCAAGCTCACGATTATTGTAATGGTCTGGTGATTTTCGTGGACTACTCCGATGCAAACATCATGGAATCGATAAAATGGATAACTGTCACAGACATGACACGTCTGATGGAAATAATGAGGGAAGGATATGGAATGAGGATAAAAGGTATTCACATTTATACACAATCTATGGCAATAGACGCACTGGTTTCTATTATGAAACAAGGGACCAGTCCTAAGGTTGCCAATAGATACAAAGTGCATAAGACCTTAGAAAGCGTCTATGATTATATACCCAAAGACATCTTGCCAGTTGAATATGGAGGCAAAGAAAAACCCTTGTTTGATCTTCACAAAAAAATGAGAAATATTTTTGCTAACGAATTCAAGGATTACTTAGAAGAAATGAGACAGGCAGGAGTGAACGAAAATCTCAGAACATCGGATTCTGTCAATGGTTCTCAATATTTGGGAATATCGGGCACTTTTAGGCAATTGAATCGTCCAGAGCCTGAATGCAGAGCCTGCGGTGCCTGGTGTATCGGCGTCGCGCTGCGATGCAATCACCGCGCACCGCAGCCAGCTTCCGAGACTACCAATACACTTGACGCTTGCGCTGGCCCAGATGGACTAGAGGCGTCGCCACGTCATAAATCCTGGACATGGCTTCCTGAAGCGATCTTGCCTCCTCGTTCACTTCGGCAGGCTTTTCTGGTGGAATTACCCAAGCCTGCACAATTGCAGCTTCGAGGAACAGCTCTCGGTTTAGCTGCCTCAGGGCCCATCGCGCACCACTGGGGCCTGCCTGACCGGCGGATCTTTGGACTCGGCGGTGACGTCGAACCGGATAAACTGGTGGTCGGAATACGTCTTCACGTCCTCCACGACGCGCCACCGGCTATTAAATCGAAATTTCCCGAAACATGGAGGTGA

Protein sequence:

>DPOGS206941-PA
MEAVPYHPLLEVTNEEFRSTRKFYDIENIKVLNESLDSVEEWIKKQDHLSEAGRHLDRESLERLFLLSRASVEGTKLKIEKFFTTRGMMPELNLNKRIEEFEKLLDFLIYVPLPKLHPKDNSRVMVTHILSDKSENFSLLSYLRFCFFVGEYRIGYDYSTADRYVIDLKNFNMNLLTKLNPILLKKAEILCTECIGTKIKGIHLLNAPPFVDKIVFILKQGLKEKVASRLTVHTTYEDFHKEVPKEILPKDFDGDGPSLSKLADQWKDMLKSDEVRNVSEKFDKLISDESKRSEMKFNEEYLGMPGSFRKLTVIYISAINFLIYSYGCFLPKLTNDHYRVYAVKNQIKNTVESGFLDYYRFYFMLCEYVQAHDYCNGLVIFVDYSDANIMESIKWITVTDMTRLMEIMREGYGMRIKGIHIYTQSMAIDALVSIMKQGTSPKVANRYKVHKTLESVYDYIPKDILPVEYGGKEKPLFDLHKKMRNIFANEFKDYLEEMRQAGVNENLRTSDSVNGSQYLGISGTFRQLNRPEPECRACGAWCIGVALRCNHRAPQPASETTNTLDACAGPDGLEASPRHKSWTWLPEAILPPRSLRQAFLVELPKPAQLQLRGTALGLAASGPIAHHWGLPDRRIFGLGGDVEPDKLVVGIRLHVLHDAPPAIKSKFPETWR-