Monarch geneset OGS2.0

DPOGS210547
TranscriptDPOGS210547-TA1707 bp
ProteinDPOGS210547-PA568 aa
Genomic positionDPSCF300304 - 24249-27949
RNAseq coverage16x (Rank: top 81%)
Annotation
Heliconius% 
Bombyx% 
Drosophila% 
EBI UniRef50%
NCBI RefSeqXP_392463.26e-0736.67%PREDICTED: similar to Choline O-acetyltransferase (CHOACTase) (Choline acetylase) (Acetyl-CoA) (ChAT), partial [Apis mellifera]
NCBI nr blastpgi|2962201236e-0740.00%PREDICTED: LOW QUALITY PROTEIN: choline O-acetyltransferase-like [Callithrix jacchus]
NCBI nr blastxgi|3544658602e-0641.98%PREDICTED: choline O-acetyltransferase-like [Cricetulus griseus]
Group
Gene OntologyGO:00084156.4e-16acyltransferase activity
KEGG pathwayrno:2905671e-06 
 K00623 (CHAT)maps-> Glycerophospholipid metabolism
InterPro domain[36-94] IPR0005426.4e-16Acyltransferase ChoActase/COT/CPT
Orthology groupMCL22666 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210547-TA
ATGGGAAGCATATTACTTGTCAACTCAACAGGAATGTTAGAAAAGTCCAAGGAACCGTATTTACTAGAAGCGAAACCAGTTACGAAACAATACGTCCAAGTGGCAAAGCTGCCCGTGCCCGAGCTTCAAGCTACCCTCGACTCGTACTTGGACTTCGCGGCCGTCATAGTGAACCATCATCAGCTTCAGAAGACGCAGGAGATCGTAAAGAAGTTCGCCGAGGATCTGGGACCCAAGATGCAGAACGTACTTTTGGATCGACAGAAAGAAATGATTAACTGGTGTCGCGGCTTAGTCGAGTTTCATAAACCAAACGCCAGTGACAACACGCTACTGTTGAAGGTGCTGCAGACCATAGACGCCCCGACTTCAGACACATACTTATTTGACGTGACTTTGAAATTATTACAAAGTGTGTGTGCGTGTATGCATGAGACATTACATAAGAAACAGGAGATATTCATCGATGACACTAACTCTATGCACTTAGCAGATGAAAAGCAAGGAAACGATATAAATCTGTACAAAGAGATATACAATAATACGTGTATTAACTTACGACATGAAATAGTTAACAAAATCAGAAGACATGAACTAGAAAATAACTATGAGAACAAACAAAGACAAGATAATGACTCAGCTCCGAATAGAAATATGATGTCTATTGATGATAACGCGAAGGATTCAATGAAGAAAATTATATTCGATACATTTATAAGAAATAACGATCACCACAAATACAGAATAGAAAAACACAATTTATTTAATAAAGACCTGTCATTGAAAAATAAAGTGGAGCACAGGGTTATTATAATAATAACATTGTATCCTCAAGAGAACAAAACTAAATTTGATATTAAACAAAGCAACATCACTGAAAAAGAAATCAACGACACTCTCGCGAACGACTCGCCGATAGATATAGACATATTAAGTGGCATTTATATTAAACGAAACAGCGTGAAATCAAAAGTAACGAATGTCAGAAAGTGCGACCTGTTTTCTATATACCTTTTTCTCATCAAGAGCATGAAAAGAAATGATGTTAAAGTGTTCAAACTTTTAAGAAGAAAAGGAACGATTAAAATATACATAGACACGACAACTTCTAAACAATTTAAAGATTTCCCTAATCCGTGCTGCGAGAACAGCACCTTGTTGAAAGCGTGTAAGAATGAAATTATATTTGAAAATAATGCTTCAGAAAATAAATATCGAACATCGACCGATAAACACATAGAAGGACATTTAAATGAAATCGTTTCTAAAATAAAATTACTCGTGAAACAATATGGAAAAAAAATATATCAAAATGGCAATCAAATCGCCAATACCACAAAGCCTTTACTTACCACAGAACATCCTGATGACGAAACTGCTGGTGGTCATTTTGAAAAGCAAAATATTAGTGAGCAAATATTTGACACGAGATCGCCATTGTATAATGAAATAACATCGGACCATATTTTTTCACGAACGGCCAATAGTCAGGACAACCTTACATCTACAAATATGATAACTGAAGCATCTTTAGGTAACGTTGGATATGATGTTAAGGACATCGAAAGTCGAATAACGCTCAACGATCAGTCTTACTTCAAACTGAGAGTTATTCGAACAACTTGTACGTTACCGCTTTATTTGAAGAAGAAACAGAAAATGTGCAGTACTTTTCCAAAAGTAAACATCCTTTTACCCAGAGTTTAA

Protein sequence:

>DPOGS210547-PA
MGSILLVNSTGMLEKSKEPYLLEAKPVTKQYVQVAKLPVPELQATLDSYLDFAAVIVNHHQLQKTQEIVKKFAEDLGPKMQNVLLDRQKEMINWCRGLVEFHKPNASDNTLLLKVLQTIDAPTSDTYLFDVTLKLLQSVCACMHETLHKKQEIFIDDTNSMHLADEKQGNDINLYKEIYNNTCINLRHEIVNKIRRHELENNYENKQRQDNDSAPNRNMMSIDDNAKDSMKKIIFDTFIRNNDHHKYRIEKHNLFNKDLSLKNKVEHRVIIIITLYPQENKTKFDIKQSNITEKEINDTLANDSPIDIDILSGIYIKRNSVKSKVTNVRKCDLFSIYLFLIKSMKRNDVKVFKLLRRKGTIKIYIDTTTSKQFKDFPNPCCENSTLLKACKNEIIFENNASENKYRTSTDKHIEGHLNEIVSKIKLLVKQYGKKIYQNGNQIANTTKPLLTTEHPDDETAGGHFEKQNISEQIFDTRSPLYNEITSDHIFSRTANSQDNLTSTNMITEASLGNVGYDVKDIESRITLNDQSYFKLRVIRTTCTLPLYLKKKQKMCSTFPKVNILLPRV-