Monarch geneset OGS2.0

DPOGS210061
TranscriptDPOGS210061-TA2121 bp
ProteinDPOGS210061-PA706 aa
Genomic positionDPSCF300017 - 811470-816521
RNAseq coverage3x (Rank: top 90%)
Annotation
HeliconiusHMEL0029820.054.87% 
Bombyx% 
DrosophilaGC-PA3e-7949.82% 
EBI UniRef50UniRef50_UPI00015B4C911e-17143.19%UPI00015B4C91 related cluster n=1 Tax=unknown RepID=UPI00015B4C91
NCBI RefSeqXP_970819.21e-17645.17%PREDICTED: similar to vitamin k-dependent gamma-carboxylase [Tribolium castaneum]
NCBI nr blastpgi|1892369973e-17545.17%PREDICTED: similar to vitamin k-dependent gamma-carboxylase [Tribolium castaneum]
NCBI nr blastxgi|3838502421e-17346.83%PREDICTED: vitamin K-dependent gamma-carboxylase-like [Megachile rotundata]
Group
Gene OntologyGO:00084884.1e-228gamma-glutamyl carboxylase activity
GO:00171874.1e-228peptidyl-glutamic acid carboxylation
KEGG pathway 
InterPro domain[27-660] IPR0077824.1e-228Vitamin K-dependent gamma-carboxylase
[47-305] IPR0110206.9e-87HTTM
[602-675] IPR0110511.7e-06Cupin, RmlC-type
Orthology groupMCL14879 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210061-TA
ATGGAGAGGTTGCGATATATCCTTTCATGCATGAAATCCGTTACTTACTCCATCTGTAAGACGGTAGATTTGAAATACCAGGAGCAGTTTGGATTTAAACTTAATGAAACAACATGTGAAAAAATTTTAGACTATTTATACGCACCCAAGGACTCGTCAAGCTTGGCAGTCACAAGAATATTGTTTGGTCTGTCTATGATGTTTGATATCCCTGATGAACGAGGAGGGTCCATCATAGACAAGCGATGGGGCGACCCAAATATATGTCACTTCCCTCTGATTCCGTTCATAACGGCAATTCCCATGCCCTACATGGCCGTTATTTACGCCATGCTCTGGATTGGTGCCCTCGGTATCACCCTGGGTTATAAGTACCGTGTAAGCGCCTCGCTGTTTACTCTCTGCTACTGGTACTTGTTTCTTATAGAGAAGAGTTTCTGGAACAATCACAGCTACCTGTTCGGTGTCGTCAGCTTGTTGTTAACCTTCACGCAGGCCAACTCCCACTGGTCCGTCGACGCTTATTTAAATCCTACGATAAGAAAGACGACGGTCCCATATTGGAACTATTTTATTCTGAAATACCAGTTCTTTATCCTGTACTTCATGGCTGGTATGAAGAAAGGCACCGCGGAGTGGCTGACTGGTTATTCGGTTCAGAACCTTAGCGAGCATTGGGTATTCACACCTTTCAAATTATTCCTATCGGTCCCACAGACCGATTACTTCATAGTTCACTGGTTCGTGTTCTCGTTTGATCTGACTGTGGCCGTGTGGATGATGTGGGCGCCGTCAAGAAACATCGCTATGTTGTTTTGTTCTCTGTTCCATCTCATGAACAGTCGACTATTCAGGATAGGAATGTTTCCATGGGTTTGTTTAGCTACAATGCCATTATTCTATCCTTTTGATTGGCCCAGAACAATAATCGGGTATCTGGACAATATAAAATCAAAGTTATTAAAACTGAGCTGCATGGTGTTATACAAAAATGTGGATTTCAAATTGACTCTTAAAGATAATGATGACAGTAAAACAAAAACCAATCAGGAACAGCCTGAAACTGAGGATGTATCAGTGCTTAATGATGGTGATAGCCCGAGCAAAGAAATGCTTACAACAAATGATGGCTTTAATGAAGGTACAGATGAGAAAATAGAAGAAAACAAGACTTCGGTTAATAAACATGACGGAAGATATCTAACTTTGATTTTTGTGATGTTCCACGTTGTATCACAAGCGATTCTCCCTTACTCTCACTTTATTACTAAAGGATATAACAACTGGACGAAAGGTCTCTATGGATATTCCTGGGATATGATGGTACATACATGGGATCTGGAGACAGTTGTGATAAAAGTCGTGGACAATACCAACAACAGAGAATTTTACATTGATCCATACATAAACTCGCCTAATGATCGTTGGACGAGACACGGAGATATGGTCCATCAATATGCCAGATGTCTTAATGAGAAGCTGAGCGCGAGGACCCGGCAAGAAGGAGGTCATAGCGAGTTGAATATATCTATCTTTATGGATATATGGTGTTCCTTAAACGGAAGATTCACTCAGAGAATGTTCGATCCCAAGGTTGACCTGTTGAAAGTTTCTTGGTCCCCCTTCAAACCTGTTTCCTTCCTAATGCCGTTGCTTGACGAAGCTTTGGACTGGAGGGGTACATTGCAGGATATAAAAACCGATGTGCATTCGTGGAATAATTACAGTGATGTTATTTTTTCTGCTGACTTTCCAGGTTACGATCAAGAAAAGTATATACCTTCAGATCTCAGCAATATAACATTAACAATTTTAAATGGTTCCGTCGCGTATGAGCCCGAGGTGACGTCAGGAGGGTACTCGTATAAATTGTCTAGAGGGGACCACATACATTTAAATCCAGACACATTCCATAGAGTTATAAATATAGGAGACACTCCAGCATACTATATGTACACGTTCGCTAATACTTCGGAAATATTAAATATTGTCCCCCCTAAACCCAAATTGCCTGTTTACCAGGAGTTACACAGACGGATAAATAATATGATAAAATTTTGCAAGCTTGTTATATCTAAACTGTTTGAAGTATACTTAAAGATGGAACGTTATTTAATGTGA

Protein sequence:

>DPOGS210061-PA
MERLRYILSCMKSVTYSICKTVDLKYQEQFGFKLNETTCEKILDYLYAPKDSSSLAVTRILFGLSMMFDIPDERGGSIIDKRWGDPNICHFPLIPFITAIPMPYMAVIYAMLWIGALGITLGYKYRVSASLFTLCYWYLFLIEKSFWNNHSYLFGVVSLLLTFTQANSHWSVDAYLNPTIRKTTVPYWNYFILKYQFFILYFMAGMKKGTAEWLTGYSVQNLSEHWVFTPFKLFLSVPQTDYFIVHWFVFSFDLTVAVWMMWAPSRNIAMLFCSLFHLMNSRLFRIGMFPWVCLATMPLFYPFDWPRTIIGYLDNIKSKLLKLSCMVLYKNVDFKLTLKDNDDSKTKTNQEQPETEDVSVLNDGDSPSKEMLTTNDGFNEGTDEKIEENKTSVNKHDGRYLTLIFVMFHVVSQAILPYSHFITKGYNNWTKGLYGYSWDMMVHTWDLETVVIKVVDNTNNREFYIDPYINSPNDRWTRHGDMVHQYARCLNEKLSARTRQEGGHSELNISIFMDIWCSLNGRFTQRMFDPKVDLLKVSWSPFKPVSFLMPLLDEALDWRGTLQDIKTDVHSWNNYSDVIFSADFPGYDQEKYIPSDLSNITLTILNGSVAYEPEVTSGGYSYKLSRGDHIHLNPDTFHRVINIGDTPAYYMYTFANTSEILNIVPPKPKLPVYQELHRRINNMIKFCKLVISKLFEVYLKMERYLM-