Monarch geneset OGS2.0

DPOGS214234
TranscriptDPOGS214234-TA933 bp
ProteinDPOGS214234-PA310 aa
Genomic positionDPSCF300014 + 926949-929377
RNAseq coverage93x (Rank: top 62%)
Annotation
HeliconiusHMEL0128378e-6451.74% 
BombyxBGIBMGA005958-TA2e-6546.21% 
DrosophilaCG12926-PA6e-6239.61% 
EBI UniRef50UniRef50_Q7Q2Q58e-6943.23%AGAP004762-PA n=4 Tax=Diptera RepID=Q7Q2Q5_ANOGA
NCBI RefSeqXP_001659081.14e-7240.97%hypothetical protein AaeL_AAEL008275 [Aedes aegypti]
NCBI nr blastpgi|1571183697e-7140.97%hypothetical protein AaeL_AAEL008275 [Aedes aegypti]
NCBI nr blastxgi|1571183697e-6940.97%hypothetical protein AaeL_AAEL008275 [Aedes aegypti]
Group
Gene OntologyGO:00068103e-15transport
GO:00056223e-15intracellular
GO:00052153e-15transporter activity
KEGG pathway 
InterPro domain[93-279] IPR0012512e-41Cellular retinaldehyde-binding/triple function, C-terminal
[51-73] IPR0010713e-15Cellular retinaldehyde binding/alpha-tocopherol transport
[10-90] IPR0110742.2e-14Phosphatidylinositol transfer protein-like, N-terminal
Orthology groupMCL20934 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214234-TA
ATGAAGATCCGACCACTTTGTCCGCTTCTGCAAGAAAAAGCCGAGAAGGAACTTTTTGAGAAATCAAATAGAATATCTTCGGATGTCGAGGCTATTAAGGAATGGTTAAAGAAACAGCCGCATTTGCAAGCAGTTAACCCAACTGATCAATGGTTGATAGCATTTTTAAGAGGGAACAAGTACAGTTTAGAAAGGACTAAAGAGAAATGTGAAATGTACTACACTCTGCGTACCGTAGTACCAGAAATATTCAAGGGTAGAGACCCAATGGACCCTTATATACAGGACATATTAAATTTAGGATTTTTTCTACCGACAAAATCATGTAAAAGCACAGATGCATGCAAGGCAACTATAACCAGGTTCGGAGCCTCTGACTCTTCAAAATATCATTTGCTTGATATTATGAAAGTAATGTTTATGATCATCGAAATACTTCTACTAGAGGATGATAACTTCGTGGTAGCTGGAATGGATGTGCTGTTTGATATGAAGGGGGTTGGGATAAACATTCTAAGTCAGTGGACTCCGACCATCGCTAAGAAATTAATATTCTTAATAGAGAAGGCTCTACCAGTGAGAATGAAGAGTAGTCACGTAATATATATACCTCCAGGTTTTGAAGCGGCTTATAATTTGTTTAAAGCTTTTGTAGCAGACAAAATTAAACAACGGTTTCATCTGTACGGCCAAAATCATGATGGAATGTACGACGCTCTTCCTCGAAGTATTCTTCCAAAGGAATATGGTGGAGACGATGGGTCTTTGCAAGAACTAATTGACTTCTGGAAAAGGAAAGTCGAAAGTTACCGTGATTGGTTCCTTAAAGAAGAAACTGAACGTTCAAATGAAGCTCTGAGACCTGATAAATCCAAAACTACTTCGAATCTTTTTGGAGTGGAAGGATCTTTTAGACAGTTAGATATTGATTAA

Protein sequence:

>DPOGS214234-PA
MKIRPLCPLLQEKAEKELFEKSNRISSDVEAIKEWLKKQPHLQAVNPTDQWLIAFLRGNKYSLERTKEKCEMYYTLRTVVPEIFKGRDPMDPYIQDILNLGFFLPTKSCKSTDACKATITRFGASDSSKYHLLDIMKVMFMIIEILLLEDDNFVVAGMDVLFDMKGVGINILSQWTPTIAKKLIFLIEKALPVRMKSSHVIYIPPGFEAAYNLFKAFVADKIKQRFHLYGQNHDGMYDALPRSILPKEYGGDDGSLQELIDFWKRKVESYRDWFLKEETERSNEALRPDKSKTTSNLFGVEGSFRQLDID-