Monarch geneset OGS2.0

DPOGS202905
TranscriptDPOGS202905-TA1191 bp
ProteinDPOGS202905-PA396 aa
Genomic positionDPSCF300126 + 195608-199394
RNAseq coverage15x (Rank: top 81%)
Annotation
HeliconiusHMEL0210837e-16372.31% 
BombyxBGIBMGA004178-TA2e-15366.41% 
DrosophilaCG1513-PA6e-14465.09% 
EBI UniRef50UniRef50_A1Z8149e-14265.09%Oxysterol-binding protein n=16 Tax=Coelomata RepID=A1Z814_DROME
NCBI RefSeqNP_610534.32e-14265.09%CG1513, isoform A [Drosophila melanogaster]
NCBI nr blastpgi|3071895269e-14663.92%Oxysterol-binding protein-related protein 9 [Camponotus floridanus]
NCBI nr blastxgi|3071895263e-14363.92%Oxysterol-binding protein-related protein 9 [Camponotus floridanus]
Group
KEGG pathway 
InterPro domain[11-394] IPR0006482.6e-205Oxysterol-binding protein
Orthology groupMCL11842 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202905-TA
ATGGGTCTGTATTGTATAATATTCGTCACAGCCCTGTATGAGGATGACTCCGACAGCGACCTGGGCTCCATGGAGAACCATGGCTCCGTGGTGACACATCTGCTATCACAAGTCAAGATAGGAATGGATCTCACTAAGGTAGTTCTCCCTACGTTTATACTGGAGAGACGGTCTTTGCTGGAGATGTATGCGGACTACTTCGCACATCCCGACCAGTTTGTCAAGATAGTGGACCAACCGACTCCGAGGGAGAGGATGATACAGGTGGTGAGATGGTACTTGAGCTCATACCACGCGGGCCGGAAGTCGCAGGTAGCTAAGAAACCATACAACCCCATACTGGGCGAGATCTTCAGGTGTCATTGGACTATAGACGGCGAACCGCAAGACAGCAATAGCAAGCAAGAAGTGGGCGACGGCCCTGTACCCTGGTGTTCCCCGGACCAGCTGTCTTTCGTAGCGGAACAGGTGTCACATCATCCACCTATATCAGCGTTCTACTCGGAGCACGTCAATAAAAGAATACAATTCGACGCCTGGGTGTGGACCAAAAGCAAATTCCTCGGACTGTCCATCGGCGTCCATAATATCGGCAGGGGCCTCGTCTCTCTGTTGGATCTGGGAGAGGAGTATTCCCTCACCTTCCCCAACGGATACGGCAGATCTATCCTGACGGTTCCCTGGATAGAGCTAGGCGGGTCTGTGGTCATCGAGTGTGTGCAGACGGGGCACAGGGCCAACATAGAGTTCCTTACGAAGCCGTTCTATGGAGGGAAAAAACACAGGGTCACGTGTGACGTGTTCGTGGGCACCGAGAAGAAGCCGTACTACTCCGCGCAGGGAGAGTGGAACACGAGGGTTGAAGGAAGGTGGACGGACACTGGAATAAATGAATCGAAGAAGAAGCGGGTGGCTCCGATATCGCGCCAGAACAGCGGGGAGTCGAGGAGGGTGTGGAGACACGTGACGGCGGCGCTCAGGGCGGCCGACACGGACGCCGCCACCAGCGCCAAGAGGAGGCTGGAGCAGGCGCAGAGGGACGCCGCCAAGAAGAGGATCGACGACAACCACAAGTGGGAGACGCAGTTATTCAAACCCAAAGGCGACGAAGGCTGGGAGTACACGACGCCTCTCAGCAAACGAATAGAGACATCAAAGTCACCTAAGAGACAGGACACGGCCACTAGATAA

Protein sequence:

>DPOGS202905-PA
MGLYCIIFVTALYEDDSDSDLGSMENHGSVVTHLLSQVKIGMDLTKVVLPTFILERRSLLEMYADYFAHPDQFVKIVDQPTPRERMIQVVRWYLSSYHAGRKSQVAKKPYNPILGEIFRCHWTIDGEPQDSNSKQEVGDGPVPWCSPDQLSFVAEQVSHHPPISAFYSEHVNKRIQFDAWVWTKSKFLGLSIGVHNIGRGLVSLLDLGEEYSLTFPNGYGRSILTVPWIELGGSVVIECVQTGHRANIEFLTKPFYGGKKHRVTCDVFVGTEKKPYYSAQGEWNTRVEGRWTDTGINESKKKRVAPISRQNSGESRRVWRHVTAALRAADTDAATSAKRRLEQAQRDAAKKRIDDNHKWETQLFKPKGDEGWEYTTPLSKRIETSKSPKRQDTATR-