Monarch geneset OGS2.0

DPOGS214660
TranscriptDPOGS214660-TA1290 bp
ProteinDPOGS214660-PA429 aa
Genomic positionDPSCF300321 - 113776-123451
RNAseq coverage3494x (Rank: top 4%)
Annotation
HeliconiusHMEL0104759e-8973.66% 
BombyxBGIBMGA001874-TA0.091.40% 
DrosophilaCG3523-PB2e-1325.75% 
EBI UniRef50UniRef50_F4X6Q03e-15473.21%Synaptic vesicle membrane protein VAT-1-like protein-like protein n=13 Tax=Coelomata RepID=F4X6Q0_ACREC
NCBI RefSeqNP_001093281.10.094.07%vesicle amine transport protein [Bombyx mori]
NCBI nr blastpgi|1537922030.094.07%vesicle amine transport protein [Bombyx mori]
NCBI nr blastxgi|1537922030.091.67%vesicle amine transport protein [Bombyx mori]
Group
Gene OntologyGO:00082708.3e-161zinc ion binding
GO:00551148.3e-161oxidation-reduction process
GO:00164918.3e-161oxidoreductase activity
GO:00167477e-38transferase activity, transferring acyl groups other than amino-acyl groups
GO:00054881.9e-13binding
KEGG pathway 
InterPro domain[53-359] IPR0020858.3e-161Alcohol dehydrogenase superfamily, zinc-type
[29-355] IPR0208437e-38Polyketide synthase, enoylreductase
[50-173] IPR0110326.7e-28GroES-like
[163-256] IPR0160401.9e-13NAD(P)-binding domain
[52-104] IPR0131542.8e-11Alcohol dehydrogenase GroES-like
[166-256] IPR0131491.2e-10Alcohol dehydrogenase, C-terminal
Orthology groupMCL16700 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214660-TA
ATGTTTGATGTCGCTTTTGGTCGCGTGTTAGTAATGCTTCACGGCGGCGCCGGAGCCCGGTGTATGACGTGGCGTCGCGACGCCCGGCGGCTGACTAAGTCCAACCCTTGTCTGATTAATTTTTCAACCCCGCCAGGCCGTTGGCTCATTCATTTCGGCCTGAACTTCCAAGACCTGATCGTTCGTCAGGGAGCTATTGATTCTCCTCCCAAGACGCCGTTCATCCTAGGCTTTGAGTGCGCAGGAGAGATCGAACAAGTCGGAGAAGGAGTAACCGATTTTAAGGTTGGAGACCAAGTGGTAGCTCTGCCGGAGTACAAGGCCTGGGCTGAGCTGGTGGCTGTACCCGCCCAGTATGTGTACGTGTTACCCGAGGGCATGTCGGCCTTGGACGCTGTCGCCATCACCACCAACTACGTGGTTGCCTACCTACTGCTCTTCGAAATGGCCAATCTGACACCCGGCAAGAGCCTACTCGTTCACTCCGCTGGGGGTGGCGTTGGCCAGGCGGTGGCTCAGTTAGCGAAGACGGTTGAAAACGTGACCGTATACGGCGTCTGCTCCAAGAGCAAGCACGAGGCTCTCAAGGCCAACAACAACAACATCGACCACCTGCTGGAGAGAGGCAGCGACTACACAAGCGAAGTCAGGAAGTCATCCCCTGATGGCGTGGACATTGTTCTTGATTGTCTGTGCGGCGAGGAGTGCAACCGCGGCTACTCCCTCCTCAAGCCCATGGGACGCTACATTCTTTATGGCTCATCCAACATCGTGACCGGCGAGACTAAGAGCTTCTTCAGCGCGGCGCGTGCTTGGTGGCAGGTGGACAAGGTGTCTCCTATCAAGTTGTTCGATGAGAACAAGAGCCTCGCGGGCTTGAACCTGCGGCACCTGCTGTTCCAGCACGCCCGGGGCGACACCGTCCGCCGCGCCGTGGACAGAGTGTTCGCGCTCTGGAAGCAGGGCCAGGTCAAGCCCCTGGTCGACTCCACCTGGGCCCTGGAAGACGTCGGTGAGGCTATGCAGAAGATGCACGACCGCAAGAACATCGGCAAGCTGGTGCTAGATCCGTCCTTGGAGCCGAAGCCGAAGCCGGCGACACCGGCCAAGGGGAAATCCGGAAAGGAGAAGAAACCAGCCAAGGAGAGCTCCGAGGAGAAGAAGGACAAGGAGTCCAAGGAAGAAGAGAAGAAGAACGAGAACGGTGACAAGAACGGTGAGGAGGTCACTAACGGTAGTGACGAAGAGTCGAAGGAAAAGGAGAAAGAGAAGGAGAAGGAATCTAGCTGA

Protein sequence:

>DPOGS214660-PA
MFDVAFGRVLVMLHGGAGARCMTWRRDARRLTKSNPCLINFSTPPGRWLIHFGLNFQDLIVRQGAIDSPPKTPFILGFECAGEIEQVGEGVTDFKVGDQVVALPEYKAWAELVAVPAQYVYVLPEGMSALDAVAITTNYVVAYLLLFEMANLTPGKSLLVHSAGGGVGQAVAQLAKTVENVTVYGVCSKSKHEALKANNNNIDHLLERGSDYTSEVRKSSPDGVDIVLDCLCGEECNRGYSLLKPMGRYILYGSSNIVTGETKSFFSAARAWWQVDKVSPIKLFDENKSLAGLNLRHLLFQHARGDTVRRAVDRVFALWKQGQVKPLVDSTWALEDVGEAMQKMHDRKNIGKLVLDPSLEPKPKPATPAKGKSGKEKKPAKESSEEKKDKESKEEEKKNENGDKNGEEVTNGSDEESKEKEKEKEKESS-