Monarch geneset OGS2.0

DPOGS207051
TranscriptDPOGS207051-TA1848 bp
ProteinDPOGS207051-PA615 aa
Genomic positionDPSCF300001 + 2104320-2113955
RNAseq coverage788x (Rank: top 16%)
Annotation
HeliconiusHMEL0104990.071.33% 
BombyxBGIBMGA012994-TA3e-15661.15% 
DrosophilaCG32751-PA1e-6342.71% 
EBI UniRef50UniRef50_Q8IRR11e-6142.71%Vanin-like protein 2 n=11 Tax=Drosophila RepID=VNNL2_DROME
NCBI RefSeqXP_001963394.15e-6632.59%GF20308 [Drosophila ananassae]
NCBI nr blastpgi|1947625441e-6432.59%GF20308 [Drosophila ananassae]
NCBI nr blastxgi|3320231533e-6035.80%Vanin-like protein 1 [Acromyrmex echinatior]
Group
Gene OntologyGO:00168111.8e-112hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds, in linear amides
GO:00068071.5e-37nitrogen compound metabolic process
GO:00168101.5e-37hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds
KEGG pathwaydpo:Dpse_GA171172e-61 
 K01435 (BTD)maps-> Biotin metabolism
InterPro domain[1-562] IPR0121011.8e-112Biotinidase, eukaryotic
[23-288] IPR0030101.5e-37Nitrilase/cyanide hydratase and apolipoprotein N-acyltransferase
Orthology groupMCL10160 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207051-TA
ATGAAGAGTATTATTATAATTTTAGCGATTTTATGCGCTGTAGATATTACAGTACAGAGATCGACTCCAGAAGCTGACAGCTATGTTGCAGCTGTTGTTGAATACCAGGTGCAGAGCAATGTGGAGACGAACTTGAGGAATTACATCAATTTAATACAAGATGCTGCATCCCAGAATGCTGACATTGTGGTATTTCCCGAGATGACCTTGACAAGAGGCAACAGCGTTGTCATCCCCATCCATGGCTTGCTTAAAGATAATCCCATACCAGCTTTGGCGCCAGAATTGTATGATGAGATCCTTGTATCAATATCAGCTGCGGCAAGACAGAATGAGATATATGTTGTTATCAACGTTCAAGAGATACTCAATTGTACAAATCCCCAAGCTGAGGGTGAGAACTGCCCAGAACAGAAACAATACTTGTTCAATACGAATGTCGTATTCAACAGGTCGGGAGCGGTTATTGATAGGTATCGAAAAATAAATCTCTTCGGTGAATTCACTCGTACACCGGCACTTTCACCAGATCTTGGGGTGTTCGAGACAGACTTCGGAGTTACTTTCGGTCACTACATTTGCTTTGACCTTATGTTCCAAGTTCCCGCTATACAGGTTGTTGAAAAAATGAACATTACTGACATCGTCTTCAGTACTATGTGGTTTTCTGAAATGCCTTATCTAACCGCCGTCCAGATTCAGCAGGCCTACGCTTACTCCATGAACGTTAATTTCCTTGCCGCTGGAGCAAATAATCCCAGAGTTGGAAGTGCTGGTTCAGGGATCTACTCGGGGAAAGCTGGTGCATTAGTCAGCATCATGCCTGGTCAGCCAACCACAAGATTGTTGGTCGCTACTGTGCCAAAGGTGCCAGGAGAGGTAACCGGTAATGTAACGGGACCAATCTACGACAGTCCATCTATCCAGGATAATCTAACACTTATAACAGACCCTTCGCTGCCCTCACATCAGACAAGACTACTAAGAAATGATGTCGAAGATTTTGTGCTCTTTGATAGAGACGTGCTCTGTCACTTTCGTGTGCGAATGAGTGGGAGAGAGGGGAATATGGCACCATTTTACAGGGCATTTGTTCAAGATGGACTCCATGTTTACGCAAAGCGTAACGTTGGTGATGTTGGTTGCGTTATAGTTGCTTGCAAAACAGAAGATCCCAAAAGCTGCGTTTACAAATTCGATAATAACGAAGGTCACACAAGTATAGAGGAATTAAAGATAACAATGACGTCATACGGAAAACATTATAATTCCACATTGAAATGCAATGACATCAAGTATAGATACAGAGCAAGTGCCTTTAGCGGAGTGAGGGATTTCAGCGGCATGGCCACAGGTGGAGCGAGGGTGTGCGCCATCTTTGCATGCACGGGAGATACTATTGATACTTGTGGGAAACGTTTTGACAATTATTCGGGCAACACTACGGTTGTTTTTGAACAGCTCGAAATCACAGCAGCAGTTCCTACTCCGATTGAAAATAGTGATCTGCAAGCATCAGATTCTAAGTATTTTCCTATATCTCTGACCACATCCATAATGCCTCTGAAGAACGAAGAATTCGCCTTTGATGAAGTATCACTTCTACTTGTCAATGTTTACTCTATGAAGTTGAGTGCAGCTACTGATGAATTATATGCTTTTGGAGTTTGGGGCAGACGGTTTGACACCGATGGCGAAGATCCGTCTCCGCCCAGAGAACATGATGTTGAAGATCTAACTACTACTCCAGCCCCAGCTACTACGCCTTCTACAGGAAGTTCTTACAGCGTTCACAAATTTAGTGCGATTTTGTTCATTATAAGCATTTTTGTTCTATTGCAGCAATAA

Protein sequence:

>DPOGS207051-PA
MKSIIIILAILCAVDITVQRSTPEADSYVAAVVEYQVQSNVETNLRNYINLIQDAASQNADIVVFPEMTLTRGNSVVIPIHGLLKDNPIPALAPELYDEILVSISAAARQNEIYVVINVQEILNCTNPQAEGENCPEQKQYLFNTNVVFNRSGAVIDRYRKINLFGEFTRTPALSPDLGVFETDFGVTFGHYICFDLMFQVPAIQVVEKMNITDIVFSTMWFSEMPYLTAVQIQQAYAYSMNVNFLAAGANNPRVGSAGSGIYSGKAGALVSIMPGQPTTRLLVATVPKVPGEVTGNVTGPIYDSPSIQDNLTLITDPSLPSHQTRLLRNDVEDFVLFDRDVLCHFRVRMSGREGNMAPFYRAFVQDGLHVYAKRNVGDVGCVIVACKTEDPKSCVYKFDNNEGHTSIEELKITMTSYGKHYNSTLKCNDIKYRYRASAFSGVRDFSGMATGGARVCAIFACTGDTIDTCGKRFDNYSGNTTVVFEQLEITAAVPTPIENSDLQASDSKYFPISLTTSIMPLKNEEFAFDEVSLLLVNVYSMKLSAATDELYAFGVWGRRFDTDGEDPSPPREHDVEDLTTTPAPATTPSTGSSYSVHKFSAILFIISIFVLLQQ-