Monarch geneset OGS2.0

DPOGS213497
TranscriptDPOGS213497-TA1032 bp
ProteinDPOGS213497-PA343 aa
Genomic positionDPSCF300100 + 425130-427150
RNAseq coverage116x (Rank: top 58%)
Annotation
HeliconiusHMEL0168291e-17184.55% 
BombyxBGIBMGA004378-TA0.087.25% 
DrosophilaCG10657-PA4e-10256.25% 
EBI UniRef50UniRef50_Q9VTY16e-10056.25%CG10657 n=8 Tax=Neoptera RepID=Q9VTY1_DROME
NCBI RefSeqXP_002007550.14e-10459.54%GI12324 [Drosophila mojavensis]
NCBI nr blastpgi|3838651382e-10359.27%PREDICTED: clavesin-2-like [Megachile rotundata]
NCBI nr blastxgi|3071764147e-10258.71%Retinaldehyde-binding protein 1-like protein 1 [Camponotus floridanus]
Group
Gene OntologyGO:00068107.1e-18transport
GO:00056227.1e-18intracellular
GO:00052157.1e-18transporter activity
KEGG pathway 
InterPro domain[131-313] IPR0012512.5e-47Cellular retinaldehyde-binding/triple function, C-terminal
[48-128] IPR0110743.4e-19Phosphatidylinositol transfer protein-like, N-terminal
[89-111] IPR0010717.1e-18Cellular retinaldehyde binding/alpha-tocopherol transport
[64-111] IPR0082735.9e-07Cellular retinaldehyde-binding/triple function, N-terminal
Orthology groupMCL16440 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213497-TA
ATGGCGACATCAGTCCTGCAGCCTGAAGAATGCCGACGGGATCACGCCCCAAAACCGGACATCGATCCAAGGATAACTGAAGACGAAAGACTGGCAATACTGGAGGCGAAGGAGAGGAATACAGGAGACCTTCCCCCCGAGCTGAGAGAAGCTGCCAGAATCGACATTCGAGAGGAAACAGCTTTAAAAGAACACGCACTCACACAAATGAGACACTTCATTGAGAAACATCCCGCTATTAAGAAATGCAGAACCGACGCACCATTTCTACTTCGTTTCCTGCGCACCAAGAAATATTCTATACCACAAGCCTGCTCAATGCTGGAACGATATCTCACTATAAGACAAATGTACCCGCAGTGGTTCCAAAAGTTGGATCCGTTAGACCCAAAAATAGCCGCAGTGATCGAAGCCGGGTACCTGCTACCCCTACCTAAAAGAGATGCCGAGGGACGTAGAGTTGTGTTGTCTTGCATGGGTCGTTTCGATCCTTATAAGTTTGACAACTGCGTAATGGCGCGTGTACACTCTATGATCGTGGAGCTGTTATTGGACGAGCCGCGATCACAACTACTAGGTTACACACACGTCAACGACGAAGCTGGCATGCAGATGCCGCACGTTAGTCTATGGTCGTTGAACGATGTCCGAATCATGCTTAACTGTATACAGAACTCAACCCCAATGCGACACAAACGCACTCATTTTGTAAACATTCCTCATTACGGTGTTAAGATCTTTGAATTCGCCGTCTCATTGCTTAGCGACAAACTCAAAGATCGTGTGATGTTCCATCGCTCAGCTGAAGAATTGTCTAAATTCGTCGACCCTGCTATATTACCGAAAGAATATGGCGGAACAGTTCCATTAAAAGACATGATCGACGAACTCAAACGAAAACTTTTGAAACACAGAGAAGAGTTACTCGCCTTGGACGATATGTGCATCGATCTGTATGCGCTCGAGAAAAATGATTTAACTCAAGACTTACACTCAACTGCTGGGTCGTTTAGGAAGCTAGAGCTAGATTAA

Protein sequence:

>DPOGS213497-PA
MATSVLQPEECRRDHAPKPDIDPRITEDERLAILEAKERNTGDLPPELREAARIDIREETALKEHALTQMRHFIEKHPAIKKCRTDAPFLLRFLRTKKYSIPQACSMLERYLTIRQMYPQWFQKLDPLDPKIAAVIEAGYLLPLPKRDAEGRRVVLSCMGRFDPYKFDNCVMARVHSMIVELLLDEPRSQLLGYTHVNDEAGMQMPHVSLWSLNDVRIMLNCIQNSTPMRHKRTHFVNIPHYGVKIFEFAVSLLSDKLKDRVMFHRSAEELSKFVDPAILPKEYGGTVPLKDMIDELKRKLLKHREELLALDDMCIDLYALEKNDLTQDLHSTAGSFRKLELD-