Monarch geneset OGS2.0

DPOGS215088
TranscriptDPOGS215088-TA894 bp
ProteinDPOGS215088-PA297 aa
Genomic positionDPSCF300187 + 214847-218605
RNAseq coverage1429x (Rank: top 9%)
Annotation
HeliconiusHMEL0217361e-13980.94% 
BombyxBGIBMGA007187-TA2e-13580.81% 
DrosophilaCG8132-PA1e-8454.61% 
EBI UniRef50UniRef50_UPI000224781E1e-9960.00%UPI000224781E related cluster n=2 Tax=unknown RepID=UPI000224781E
NCBI RefSeqXP_001599587.15e-10158.51%PREDICTED: similar to ENSANGP00000002264 [Nasonia vitripennis]
NCBI nr blastpgi|3454929774e-9960.00%PREDICTED: hypothetical protein LOC100114668 [Nasonia vitripennis]
NCBI nr blastxgi|3228003504e-9762.13%hypothetical protein SINV_03107 [Solenopsis invicta]
Group
Gene OntologyGO:00068071.4e-84nitrogen compound metabolic process
GO:00168101.4e-84hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds
KEGG pathwaynvi:1001146342e-100 
 K13566 (NIT2)maps-> Alanine, aspartate and glutamate metabolism
InterPro domain[22-291] IPR0030101.4e-84Nitrilase/cyanide hydratase and apolipoprotein N-acyltransferase
Orthology groupMCL11261 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215088-TA
ATGTGTGATAGTGCAATAGATGCGGTACATCCGCGATTTTCAAAGAAACCGATGTTTTATCAAACTGGGTTTAAAATAGCGCTCATCCAACTATCTGTTGGCCCGGACAAAGCAAAAAATGTAGCCGCTGCTGTCAGTGAGATACACAAAGCTAAAGCGAAAGGGGCTCATGTAGTAGCTCTGCCAGAGTGTTTCAACTCTCCATATGGAACTAAATATTTCAATGAGTACGCGGAGGAAGTACCGTCGGGGGCCACGAGCCGAGCGTTATCAAGAGCAGCGGCGGAGGCGGGCGTGTGTGTGGTCGGAGGGACCGTGCCTGAGCGGTGTGGCGATAGACTATATAACACTTGTACCGTTTGGGATGACAGTGGGAAGCTACTGGCTCAGTATAGAAAGATGCACCTCTTTGATATAGACATCCCGAACAAGATAACGTTCAAGGAATCCGAAGTACTGAGCGCTGGTGACCAAGTGACTACCTTCGACTACCGCGGAGTTAGAATCGGTATCGGGATATGTTACGACATACGCTTCCCCGAACTCGCGCATCTCATGGCCCAACAAGGGTGTTCCATGTTGCTGTACCCGGGCGCGTTCAATATGACGACCGGCCCCAAGCACTGGGAGCTGCTGGGCCGGGCTCGGGCCAACGATTGTCAGTTGTGGGTGGGCCAGATCAGCCCGGCGAGGGACGCGGCCGCGGGGTACGTCGCCTGGGGACATTCCATCCTCGTCGACCCCTGGGGTCAGGTCAAGGGTCAGCTTGACGAACGACCCGGCGTCATTATCGAGGACATCGATCTGAAGGTAGTTGAAGAAGTCAGGTGTCAAATACCAATAAGAATACAAAGAAGAACCGATGTCTACGACACGGTGTCCGTGAAACAGTGA

Protein sequence:

>DPOGS215088-PA
MCDSAIDAVHPRFSKKPMFYQTGFKIALIQLSVGPDKAKNVAAAVSEIHKAKAKGAHVVALPECFNSPYGTKYFNEYAEEVPSGATSRALSRAAAEAGVCVVGGTVPERCGDRLYNTCTVWDDSGKLLAQYRKMHLFDIDIPNKITFKESEVLSAGDQVTTFDYRGVRIGIGICYDIRFPELAHLMAQQGCSMLLYPGAFNMTTGPKHWELLGRARANDCQLWVGQISPARDAAAGYVAWGHSILVDPWGQVKGQLDERPGVIIEDIDLKVVEEVRCQIPIRIQRRTDVYDTVSVKQ-