Monarch geneset OGS2.0

DPOGS210491
TranscriptDPOGS210491-TA975 bp
ProteinDPOGS210491-PA324 aa
Genomic positionDPSCF300186 - 243629-247532
RNAseq coverage171x (Rank: top 50%)
Annotation
HeliconiusHMEL0163386e-10767.10% 
BombyxBGIBMGA012582-TA6e-13291.02% 
DrosophilaCpsf73-PA7e-15682.11% 
EBI UniRef50UniRef50_Q9VE519e-15482.11%Cleavage and polyadenylation specificity factor 73 n=31 Tax=Eumetazoa RepID=Q9VE51_DROME
NCBI RefSeqXP_395515.27e-16385.40%PREDICTED: similar to CG7698-PA [Apis mellifera]
NCBI nr blastpgi|3071777722e-16285.08%Cleavage and polyadenylation specificity factor subunit 3 [Camponotus floridanus]
NCBI nr blastxgi|3800120762e-15485.40%PREDICTED: cleavage and polyadenylation specificity factor subunit 3-like [Apis florea]
Group
Gene OntologyGO:00167879.6e-16hydrolase activity
KEGG pathway 
InterPro domain[30-210] IPR0012799.6e-16Beta-lactamase-like
[244-317] IPR0227122.7e-12Beta-Casp domain
Orthology groupMCL14908 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210491-TA
ATGACTACTAACCCCAGAAGAGGCACCGATCCCGTGCCAATGGAAGAAAGTGATCAACTCACTATAAGACCATTGGGAGCTGGTCAAGAAGTGGGCAGATCCTGCATCATGTTAGAATTTAAAGGAAAGAAAATAATGTTGGATTGTGGAATCCACCCCGGCCTGTCGGGGATGGACGCTTTACCATTTGTGGATCTCATAGAAGCTGATGAAGTGGATCTCTTGTTGATATCACATTTCCACCTGGACCACAGCGGGGCGCTGCCCTGGTTCCTCACCAAGACCTCGTTCAAGGGCCGCGTGTTCATGACGCACGCCACTAAGGCGATCTACCGCTGGCTTGTCTCGGATTATATTAAAGTTAGCAACATATCCACGGAGCAGATGTTGTACACGGAGTCCGACCTGGAGGGGTCCATGGATCGTATAGAGACCATCAACTTCCACGAGGAGAAGGACGTGAGGGGCGTGAGGTTCTGGGCGTACAACGCGGGCCACGTGCTGGGGGCGGCCATGTTCATGATAGAGATCGCCGGAGTCAAGGTGCTGTACACGGGCGACTTCTCGCGGCAGGAGGACAGACATCTGATGGCCGCGGAGATCCCCACCGTACACCCGGACGTGCTGATAACGAAGAGAGAGGAGCGAGAGAGTCGCTTCACCACGCTGGTGAGCGACGTGGTGGGCCGCGGGGGGAGGTGCCTCATACCCGTGTTCGCGCTGGGCAGGGCGCAGGAGCTGCTGCTAATACTGGACGAGTACTGGTCGCTGCACCCGGAGCTGCAGGACATCCCCATATACTACGCGTCTTCTCTCGCCAAGAAGTGCATGGCGGTGTACCAGACCTACGTCAACGCCATGAACGACCGCATCCGGAGACAGATCGCCGTCAACAACCCCTTCGTCTTCAGGCACATATCTAATCTGAAGGTGGGTGCGGAGGCTCCGCGAATTACAAACACGCTGTCCTGCTAG

Protein sequence:

>DPOGS210491-PA
MTTNPRRGTDPVPMEESDQLTIRPLGAGQEVGRSCIMLEFKGKKIMLDCGIHPGLSGMDALPFVDLIEADEVDLLLISHFHLDHSGALPWFLTKTSFKGRVFMTHATKAIYRWLVSDYIKVSNISTEQMLYTESDLEGSMDRIETINFHEEKDVRGVRFWAYNAGHVLGAAMFMIEIAGVKVLYTGDFSRQEDRHLMAAEIPTVHPDVLITKREERESRFTTLVSDVVGRGGRCLIPVFALGRAQELLLILDEYWSLHPELQDIPIYYASSLAKKCMAVYQTYVNAMNDRIRRQIAVNNPFVFRHISNLKVGAEAPRITNTLSC-