Monarch geneset OGS2.0

DPOGS204685
TranscriptDPOGS204685-TA2976 bp
ProteinDPOGS204685-PA991 aa
Genomic positionDPSCF300170 + 20987-29855
RNAseq coverage216x (Rank: top 45%)
Annotation
HeliconiusHMEL0105290.066.90% 
BombyxBGIBMGA010132-TA0.065.95% 
DrosophilaHcs-PA1e-13530.79% 
EBI UniRef50UniRef50_Q9VNC32e-13330.79%Holocarboxylase synthetase n=12 Tax=Drosophila RepID=Q9VNC3_DROME
NCBI RefSeqXP_395226.22e-14231.95%PREDICTED: similar to CG14670-PA, partial [Apis mellifera]
NCBI nr blastpgi|3504208273e-13931.84%PREDICTED: biotin--protein ligase-like [Bombus impatiens]
NCBI nr blastxgi|3454933412e-14232.38%PREDICTED: biotin--protein ligase-like [Nasonia vitripennis]
Group
Gene OntologyGO:00064649e-139protein modification process
GO:00040779e-139biotin-[acetyl-CoA-carboxylase] ligase activity
GO:00038242.3e-12catalytic activity
KEGG pathwayame:4117596e-142 
 K01942 (HLCS)maps-> Biotin metabolism
InterPro domain[418-992] IPR0044089e-139Biotin--acetyl-CoA-carboxylase ligase
[758-858] IPR0041432.3e-12Biotin/lipoate A/B protein ligase
Orthology groupMCL13928 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204685-TA
ATGACTTTCATCAAAACTTGGCGCATTCGTTCATTCAGATCTCGATTATTGGAGTTCTTGAGTGGTAACACAAGTGTGGCCTTCTGTAAAGTTCCCGAAAATGCTCCAGAGACCCCAACGCTCAGTTCAAAATTATGTACTAACCCAAAGAATGCATATCTATGTGATCTCCTTGATTATACTAATGCATCACGGATATGCAGCATGAAGCCCACACAATGCATTGATATAAACGACTGGGTGGCATGTACTGGTGGACTGCCTTTAGGAAACGAATACAATTTACCTATCGATGATCTTAATGTTGAAGTGCTTTTTGAATGCGAGATGGATCCGCATGGGGATAGCTCATATCAAAATATAATAACTTTGGAATCGATGGGTCAACCTATAGCTTGGAAGGCCGGTTCGCATTTAGCTATTGTATTAAAAACGAACATTGAGCATCTGAGTCTGGTCGGCTTCCATTTTGTTGGTGGGGAATTCATTATTGATCACAAGTTGCCCGTTGTTAGGATACAAACTGTCAAACTTTCCGGGGAGCCAAATAAAGCTGTCGACTTATCAATGCGAGGACTCCGTAGTGAAATTCAGCATAGTTTGAAATTATCCCACGAGGAATTCCAATGGCCTCTCCACGCGGCGCTCTTAAGGAAACTTTCGAAGGTCGTTGACATTAAGTCAGAAAGAAGGAGCATAGAAATGAATGGGGGCAGTGATAAGGTTATAGACACGTCACCAAGAAAGAAAGTCACGCTTCACGATGTCAGTACAATCACAACTAAATCACTGGGAGAGGAAGAAAAGCCGGATGTATCGAAAGAATTGAGTGCAGTTGAAGTGAAGCCGGAGAAAACTGAGATACTTGTCGAGAAAATTGTGAAGGAAATCGTCCATAAATCACCAGAAAGGAAAATAAGGAGGTCTCCAGAAAAGATCATAAACAAATCACCAGAAAAGAAGCTGGTGGTGCCAGAAAACGGTGAACTAGTAGTGGAAAACGTGACCGTAGAAATACAACATAACATAACACCAAAAAAATCCAGTCTCAAGTTGGATCTCAAGAAGGGCACGAGCCAAGCTTCCTCGTCGCTACTCAACATCTCCAGTCAAATTAAATCAATTACGACACACAAAAGAGTAGTCAGCACGACCGCGCCGAGACGTCCACAGAGTTCGGCTTCAAATTATTCAAACGTTAAAGAGGATACACCAAAGAGTTCGCCCTCGCAGATGACGCCCAAGTCAGCCAAAAGTTCCCGTCCGTTAGTGAATACTAAGTCCCTGAACGTATTAATATATTCCGATAGTGTAGTGGCTAGGGATAACCTGGAATCGTCCCTGAGGAAAGTCTTAGACTGTGATCGGTATACAGTATATAGCGTGTCTGCGGCTTCTCTCCTTGGAGGTGCGTGGCGTGGGCGGGCGGCGTTAGTGGCAGTGGCAGGTTGTGCTGGAAGGGCGGCACCCTTACTACTGGCACACCTGTTAGACGGCGGAAGACTTCTGGCTCTCTGCTCCGACCTGCTGCACACTGTGCTACCTCACTATAAGACCGCTGAGATAAGGGAAAACGAGGTGGTTCAATTCTCATACGACAAATGGGAGTCGGTTAAAATGAAACATCACATATTCTGCTACCAGGCGTCGCCAGCAAAGAAACAGTTCTCAACTGAGAGCGACAGACAGCCATCCAAATACACGGGTCATCCAGGTCATGAGCTGGATATCCAGGTGTTAGCTTCTGAAGAAACCTGGAGGACGCCATCACTGCTGCAAGCCACTGACACAACTAACAAAGGAATGGCCATTTTCTCGCAGATCCACTTAGAAGCTGATCCCTGTGACTATCTCGGGGAGGAAGGTGTGAGGAGCAACGAGGCCAGGATTGAAATACTCCACAAGATACTAGGAGACGTTCTCGGCCTACACGTTAAAGATCCCAAAGAACTCACCACCGTCGAATACACCAGGGGATACTTCCTGGGAAACCACCTGCAAAAGATCTCGCTAGTGGAGGCGTGGTGTGGGGGTGTCGGGGGTGCGACGAGGGTGCAGGATGTGGGGGGTGTGAGGGTGCAGTGGTGCGGGAGGGGGGAGGAGGGTGATCCCCCCGGGGATGCCAGCCTGCCGGTGTCACTCTACGAGTGTCCAGAGAACTTTGACACTGTTGAATATTTTGATAATCTTACCTCAAGATCGCTGGGCCGCCTTTTGATATACGCGGATGTGTTGGCTAGCACTCAATTCATCAGCGGATGTTCGCTGGCTCATGGCGTGGGGGCGACTGCCAGGAGACAGGTCAAGGGTCGCGGAAGAGGAGGAAACACCTGGATCACACCACACGGACAGGCCGCCATTACTATATGGCAGAAGATTTCGCCCTTACTACCATTATACCAGCACGCGGCTGCGCTCGCCGCCACGAGGGCTGTACGACTACAACACGGATACGATCAGATAGACATACGTATAAAATGGCCGAACGACATCTACTACGGGCGGGAGATGAAGGTGGGCGGCACCATCGTCACGGCGAACTGCATCGGCGATGACGTCATCATCGGCATCGGCACGGGTATGAACGTCTCTAACAAGGTGCCGACGACCTGTCTCGATGATATAATAACGGATTACAACAAGAGCCGCGGCACATCACTAGCGTTGATATCGATAGAAAAGTTCCTCGCCCGATACTGTTCAGAACTGGAAATTATTCTGGATGATCTTGAGAAAAATGGCGTGGAGGGATTCCTTGAGGATTATTATAAATACTGGATGCATTCTGGCGATGAGATCACAATAACCGGCGCCAGCGGGGCCGCCGTGCCCGGGGTGGTCAGAGGCGTCGACGATGCCGGCTGGCTGTTGGTAGAGACGGGAGGCGAAGCCGGGGGGCGAAGCGATATAGTGCGGGTGGCGCCCGACGGCAACACCTTCGATATCATGGCCGGCCTCATTGCACCCAAGTGCTAG

Protein sequence:

>DPOGS204685-PA
MTFIKTWRIRSFRSRLLEFLSGNTSVAFCKVPENAPETPTLSSKLCTNPKNAYLCDLLDYTNASRICSMKPTQCIDINDWVACTGGLPLGNEYNLPIDDLNVEVLFECEMDPHGDSSYQNIITLESMGQPIAWKAGSHLAIVLKTNIEHLSLVGFHFVGGEFIIDHKLPVVRIQTVKLSGEPNKAVDLSMRGLRSEIQHSLKLSHEEFQWPLHAALLRKLSKVVDIKSERRSIEMNGGSDKVIDTSPRKKVTLHDVSTITTKSLGEEEKPDVSKELSAVEVKPEKTEILVEKIVKEIVHKSPERKIRRSPEKIINKSPEKKLVVPENGELVVENVTVEIQHNITPKKSSLKLDLKKGTSQASSSLLNISSQIKSITTHKRVVSTTAPRRPQSSASNYSNVKEDTPKSSPSQMTPKSAKSSRPLVNTKSLNVLIYSDSVVARDNLESSLRKVLDCDRYTVYSVSAASLLGGAWRGRAALVAVAGCAGRAAPLLLAHLLDGGRLLALCSDLLHTVLPHYKTAEIRENEVVQFSYDKWESVKMKHHIFCYQASPAKKQFSTESDRQPSKYTGHPGHELDIQVLASEETWRTPSLLQATDTTNKGMAIFSQIHLEADPCDYLGEEGVRSNEARIEILHKILGDVLGLHVKDPKELTTVEYTRGYFLGNHLQKISLVEAWCGGVGGATRVQDVGGVRVQWCGRGEEGDPPGDASLPVSLYECPENFDTVEYFDNLTSRSLGRLLIYADVLASTQFISGCSLAHGVGATARRQVKGRGRGGNTWITPHGQAAITIWQKISPLLPLYQHAAALAATRAVRLQHGYDQIDIRIKWPNDIYYGREMKVGGTIVTANCIGDDVIIGIGTGMNVSNKVPTTCLDDIITDYNKSRGTSLALISIEKFLARYCSELEIILDDLEKNGVEGFLEDYYKYWMHSGDEITITGASGAAVPGVVRGVDDAGWLLVETGGEAGGRSDIVRVAPDGNTFDIMAGLIAPKC-