Monarch geneset OGS2.0

DPOGS214235
TranscriptDPOGS214235-TA927 bp
ProteinDPOGS214235-PA308 aa
Genomic positionDPSCF300014 + 931541-932966
RNAseq coverage14x (Rank: top 82%)
Annotation
HeliconiusHMEL0128375e-6150.43% 
BombyxBGIBMGA005958-TA1e-6750.18% 
DrosophilaCG33966-PA5e-6339.41% 
EBI UniRef50UniRef50_Q16ZA05e-7242.67%SEC14, putative n=6 Tax=Endopterygota RepID=Q16ZA0_AEDAE
NCBI RefSeqXP_001659081.13e-7442.67%hypothetical protein AaeL_AAEL008275 [Aedes aegypti]
NCBI nr blastpgi|1571183695e-7342.67%hypothetical protein AaeL_AAEL008275 [Aedes aegypti]
NCBI nr blastxgi|1571183693e-7142.67%hypothetical protein AaeL_AAEL008275 [Aedes aegypti]
Group
Gene OntologyGO:00068101.1e-12transport
GO:00056221.1e-12intracellular
GO:00052151.1e-12transporter activity
KEGG pathway 
InterPro domain[92-278] IPR0012519e-46Cellular retinaldehyde-binding/triple function, C-terminal
[9-89] IPR0110745.7e-15Phosphatidylinositol transfer protein-like, N-terminal
[50-72] IPR0010711.1e-12Cellular retinaldehyde binding/alpha-tocopherol transport
Orthology groupMCL20934 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214235-TA
ATGTTTCGACCTTTGCGCCCGGCCCTACAAGAGAAAGCAATCAGAGAGGTATTTGAAAAGCCTAACAGAATAACGTCTGATATCAAAACCCTGAGGGAATGGCTCGAAAAGCAGCCGCATTTACAAGCAGTTAATCCTTCTGATCAATGGCTCTTGTCATTTTTAAGAGGAAATAAATTCAGTCTGGAAAAAACTAAAGAGAAATTGGACATGTATTATGCATTAAAAAATATCGTGCCAGAAATTTTCAAGTATCGAGATCCGTTCGATCCAAAAATTCAAGAAATCTTAAAACTCGGACCGTATCTACCTCTCACATCACTTACTTCGGATGATGGACAGCGATTTTGTGTTACCCGCTTTGGATTACATGATCCAAATAAGATCCACATATTCGATATCTTAAAGGTGATAATTATGATAATTGAAATACTCATGTTTGAAGATGACAATTTTATTATTTCTGGAGAGAGTTTATTTATTGATCTGAAAGATGTTAGTCTAATTACTTTTAGTCAATGGACTCCCAACGTTGCAAAAAAAATTTTAGTATGTGTGGAGAAAGCTCTTCCAGTTCGAATGAAAAGTTGCCATTTGTTGAATATTCCTCCAGGATTTGACACGGCATTTGCTATATTTAGGGCTTTTGTCAGTGAGAAACTTAAAAATCGAGTTCATGTGTACAAGAAAAATTATGAAGAAATCTTTAATACAATTCCCAAAAACATTTTACCAAAAGAATTGGATGGAGAAACTGGCACATTACAAGAAATTGCCGACTACTGGAAAGCAAAGGTTGAAAGTTATAGAGACTGGTTTCTCAACGAAGAAGAATGTTCGGATGAGTCTTTAAGACCAGGAAAACCTAGATCTTCTTCAAGCATTTTTGGTGTAGAGGGTTCTTTTAGACAACTAGATGTAGACTAG

Protein sequence:

>DPOGS214235-PA
MFRPLRPALQEKAIREVFEKPNRITSDIKTLREWLEKQPHLQAVNPSDQWLLSFLRGNKFSLEKTKEKLDMYYALKNIVPEIFKYRDPFDPKIQEILKLGPYLPLTSLTSDDGQRFCVTRFGLHDPNKIHIFDILKVIIMIIEILMFEDDNFIISGESLFIDLKDVSLITFSQWTPNVAKKILVCVEKALPVRMKSCHLLNIPPGFDTAFAIFRAFVSEKLKNRVHVYKKNYEEIFNTIPKNILPKELDGETGTLQEIADYWKAKVESYRDWFLNEEECSDESLRPGKPRSSSSIFGVEGSFRQLDVD-