Monarch geneset OGS2.0

DPOGS207496
TranscriptDPOGS207496-TA1647 bp
ProteinDPOGS207496-PA548 aa
Genomic positionDPSCF300051 + 773512-777762
RNAseq coverage791x (Rank: top 16%)
Annotation
HeliconiusHMEL0123299e-9292.09% 
BombyxBGIBMGA009838-TA0.066.60% 
Drosophilaretm-PA2e-9360.74% 
EBI UniRef50UniRef50_E0V9H61e-11673.09%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0V9H6_PEDHC
NCBI RefSeqXP_319779.41e-13349.35%AGAP009029-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|2420035495e-11673.09%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastxgi|2420035493e-11373.09%conserved hypothetical protein [Pediculus humanus corporis]
Group
Gene OntologyGO:00068102.4e-14transport
GO:00160212.4e-14integral to membrane
KEGG pathwayuma:UM04979.14e-08 
 K01101 (E3.1.3.41)maps-> gamma-Hexachlorocyclohexane degradation
InterPro domain[93-282] IPR0012518.5e-65Cellular retinaldehyde-binding/triple function, C-terminal
[14-105] IPR0110745.4e-25Phosphatidylinositol transfer protein-like, N-terminal
[390-531] IPR0090382.4e-14GOLD
[46-87] IPR0082732.7e-10Cellular retinaldehyde-binding/triple function, N-terminal
Orthology groupMCL12001 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207496-TA
ATGACCGAAATACCACGCCAGCCACAGCGGCCGCGGAATTGTGATTTACCCCGGAACATCGATTTAGATCCATTCAGCCTGGACAGCGACTACATCAAGAGGTACCTCGGCGAGCTGACGCCCACCCAGGAGTCGAGGCTGCTGCAGCTCAGGAAGTGGATCGCGGAACTGCAGAAGGGGAAGGTGCCGAGTGACACGACGCTGCTTCGTTTCCTTCGAGCTCGAGACTTCAGCGTGGAGAAGGCTCGTGAGATGCTCTCCCAGTCCCTACTCTGGCGCAAGAAGCACCAGGTGGACAGGCTGCTGTCGGAGTACGAGACGCCGGAGGTCGTGCGGCAGTACTTCCCCGGGGGGTGGCATCACCATGACAAGGACGGACGTCCGCTTTACATACTCAGACTTGGACAAATGGACGTAAAGGGCTTGTTGAAATCCATCGGAGAAGATGGCCTCTTGAAGTTGACGCTTCACGTGTGTGAAGAGGGCCTGAAACTGCTGGAGGAGGCGACCAGGTCGTCGGAACACGCCATACAGTCGTGGTGCCTGCTGGTGGATCTGGACGGGCTGAACATGAGGCACCTGTGGCGGCCCGGGGTCAGGGCTTTGCTGAGGATCATCCAGATCGTGGAGGCCAACTACCCGGAGACCATGGGCCGGGTCCTCATCGTCCGGGCGCCCAGGGTCTTCCCAATACTATGGACCATCGTTAGCACTTTCATAGATGAAAACACCCGCAGTAAGTTCTTGTTCTACGGCGGCAAGGACTACCTCCAGCCCGGGGGTTTGCTCGATTACATCCCCAAAGACCTCATCCCAGACTTCCTCGGGGGACCTTGCAAGTCGGACATAGTGGACGCGTCGCTGTACGTGCCGTCCCACTATTACGACTCTGTACGACGTGAGACTATAATATACGCACTCTGCCACTGTGGCGTGCAACGCATCAATGTGGCGTGCAATAACATGTGCAACACTGTTGTCCGTACCGTCCCGTCTGTGTCACAGATCATGGTTGCACCGATGCATGGCTGCGTGTTCACTAGACGCGTGCGAGTCCCCAGGAGCTCTCATAGTATTATGTTTAGTTTCGTCCACGAGGGCGGGCTGGTTCCGAAGAGCTTGTACGTCAGCGGCGCCTTCACTGAGCGCGACGGAGACCCCCTCAGTGAAGACAGCATCTACAAATCGGTCAGCCTGGCGAGAGGACAGGTGCACGAGGTGGTGGTGCACAACCGCGACCCGCACAGCGTGCTGACGTGGGACTTCGACGTGTTACGGCACGAAGTGTACTTCAGCGTGTTCAGGAGCACCAAGGAGCTGCAGCCTCCGGAGCAGCCCGCCGCCGGCGACGAGAGCCGCTCTGTCCTGGAGGGTGCTGGTCGCGAGGGTGAACACTACCACCGCGTGGAGACGTCGCTGCTGTGTCACGACGGAGAGAGCATACAGGGTTCCCACGTGATGTCATCGCGAGGCAGCTACGTCCTCCAATGGCGGTGCGAGGGCGCCCGGGGGCCGGACGGGGCGCAGCTGGTGTACTTTCACGAGACGCTCGCCAGCCACCACTACCGCGGGTCCATGTCGAGCCTGCAGTCCGCCACCAGTGGCTTCTCGTGCCTGAGCGGGTCCAGCTCGTGTCCGTCCCGGTGA

Protein sequence:

>DPOGS207496-PA
MTEIPRQPQRPRNCDLPRNIDLDPFSLDSDYIKRYLGELTPTQESRLLQLRKWIAELQKGKVPSDTTLLRFLRARDFSVEKAREMLSQSLLWRKKHQVDRLLSEYETPEVVRQYFPGGWHHHDKDGRPLYILRLGQMDVKGLLKSIGEDGLLKLTLHVCEEGLKLLEEATRSSEHAIQSWCLLVDLDGLNMRHLWRPGVRALLRIIQIVEANYPETMGRVLIVRAPRVFPILWTIVSTFIDENTRSKFLFYGGKDYLQPGGLLDYIPKDLIPDFLGGPCKSDIVDASLYVPSHYYDSVRRETIIYALCHCGVQRINVACNNMCNTVVRTVPSVSQIMVAPMHGCVFTRRVRVPRSSHSIMFSFVHEGGLVPKSLYVSGAFTERDGDPLSEDSIYKSVSLARGQVHEVVVHNRDPHSVLTWDFDVLRHEVYFSVFRSTKELQPPEQPAAGDESRSVLEGAGREGEHYHRVETSLLCHDGESIQGSHVMSSRGSYVLQWRCEGARGPDGAQLVYFHETLASHHYRGSMSSLQSATSGFSCLSGSSSCPSR-