Monarch geneset OGS2.0

DPOGS204491
TranscriptDPOGS204491-TA4152 bp
ProteinDPOGS204491-PA1383 aa
Genomic positionDPSCF300002 + 1265486-1278774
RNAseq coverage858x (Rank: top 15%)
Annotation
HeliconiusHMEL0118640.093.71% 
BombyxBGIBMGA007835-TA0.083.31% 
Drosophilasick-PB0.062.36% 
EBI UniRef50UniRef50_Q9VIQ90.062.36%Protein sickie n=32 Tax=cellular organisms RepID=SICK_DROME
NCBI RefSeqXP_002064963.10.052.39%GK14929 [Drosophila willistoni]
NCBI nr blastpgi|1954339370.052.39%GK14929 [Drosophila willistoni]
NCBI nr blastxgi|2813652330.050.18%sickie, isoform E [Drosophila melanogaster]
Group
Gene OntologyGO:00001663.8e-05nucleotide binding
GO:00171113.8e-05nucleoside-triphosphatase activity
KEGG pathwaydpo:Dpse_GA104760.0 
 K01516 (E3.6.1.15)maps-> Thiamine metabolism
    Purine metabolism
Orthology groupMCL10709 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204491-TA
ATGGCAGTGACAACGTCTCCAACGACTTTTTATGGATCTCCAATCCACGATGGATTTGCAACTATTCGTGCACCCAGGAGTCGAATAAAGATAAGAAATCTAGTTGAACCAAATACCACATCTAACCTACCTCAAAGGCATTCCGAATATTTTACATTAAATAGATCAGCCAAAAACAACTTCTTAGAGAGCAGTTGCGGAGTGCAGTATGCAACTAATGGGTCACAAAGTATATATGTAGATAAACCTTCAAGAATAAGTTTTACTGAATCTATTTATGCCAAAGTATCTCCACCGCCTTCAAATAGGACGTCTCCGGCAAAATTTCTTAATGGTAGACAGAAAGAAGAAAGCCATTATGATTCTATAGATACTAAGCCTCGTAGGAGATTAAAAAGCCTAGAATTACCGAAAGAGACAGATCAAGAGAATGAGGAAAAAGTTATGATTGCTAGTTTCTTGAATAACGGCGAAGTGAGTCCGTATGAAATGTATATAGCTAACCAGGATACCGCAGAACAACATTCCGTCCGCACGAACACTACTGGATCATCCGTTGACTATAGAAAGAGAAATAGCGACCGGGATAACTGCTCGGAATCATCTTCATACGTAACCGATAACGACAGACCGATATCAAGTTACAGTGATAATTCTACGATACCATCCACCGACACGGAAGACGTATTAAAGGAATTACCGAGCAAATCAAGATTTCATAAATCACCCCAAAAGTATGCGACGTTAAATTTAAGGCGGCCAAAGTTTATAGATCTTAAGCCGCCGGGAGTAAACGATAACCCGTTTTGCGGCAGCCTACATCGAAATAAATTGGGATATCACTCAGAACCAAGCACTCCTTTATCTGGAGACAGTAATGATTTCACCGGCACTAAAAATTTTCAAAGTGGAGTTGAACAAATACCTAAAATGCCGTTACAGTATAATAGAAATGGCTTTAAAAGATCTGTTAGCGAATCAAATGGTTTCTCGAAACGATTAAATTATCGTCATTCCTTCAGTGCAGATTACAAACCACAGTCTGTCGTTAGACGACCTCATAAATGCTGTGAATGTGTAACCGGCGTACCAGCAGAAGATGATATCGATACTTCGCAAACTTCAAGAACTTTAGGAACATTATACGAATCTCAAGACCCAAAAGTTGGATGTCAAACAATCCTTAGATCGAAACCGCCTGTGCCTTGGTGGGAATTGGCAATAAAAAAATCTCGCTACAAGAGCTGTCCTATTTTAGAAGAGGCTCATGTCGTGTCCGCCTTTGAACAAAGTTTATCGAATATGACCCAGAGGCTGCATCAGCTGACGGCAACTGCAGAACGCAAGGATTCTGAGCTAACTGAGCTTCGGCAAACAATTGAGCTGCTTCGGAAGCAATCAATCCAAGCGGGTTTGACGACGGCGCACATGCAGTCCATGGGCATCCGAGCTGATGGCGTCAATGTTACCGGCCAGGAACCACCTCAAAATCAAACTCAACAGTCATCACCACAGAGATTGGCTCAAGGCAATGGTGCTATTACCCGCCACCTCTCTACAGATAGTGTCTCCAGTATTAATAGTTTGAGCAGCGGCTCATCAGTTCCTCACGATAAGAAACACAAGAAGAAAGGATGGTTACGGTCGTCGTTCACGAAAGCATTTTCGAGAAATGCCAAGATATCAAAAACAGCAAAGCATTCGTCCCTCGGGCAGCTGTCGTCACAGGACAGCTCATCGGGATCACACCATTATGATGACCCGCACACGATCCGAGAAGGAAGCAACGAAAATAGTCTCGAACATTCTCACGAAGTTCTTGACAATAGCAAAGATAAAACTACATGCCCTCCAGCCGTCGAAGAACAGACCAAAGAAAAAGCTGACCAAGCCGGTTTAGTGGATGAACTAAAACGGCAATTAAGAGAAAAAGACTTGGTGTTAACAGATATAAGACTGGAAGCGCTCAGCTCAGCGCATCAACTAGAAAGTCTCAAGGACACCGTAATCAAAATGAGGAATGAAATGCTGAATCTGAAACAAAACAATGAGCGGTTACAAAGGTTAGTGACATCTCGGTCGCTGGCCGGCAGTCAGAGCTCGTTAGGTACAGGCGGCTCCGCGGTTGAGGACCCCAGACGGTTCAGTCTGGCTGACCAGGCTACCATGCATCAGGCGGCTATGGACATTCACTCACAACCTTTAGATTTAGATTTCAACTGTATGACGTCTACACCTTCTCGTGATCTATCTAAAAAAGGATCACCCAAGTCTAGTATGATGGAACCAATATACGGTAATAAAGCTGTTTGCGAACTAAACGAAAACAGTGAAGCCTTACTTGGAGCGCTAAATGGTGCTAGTGATATGTTTAGTAATGGTTTAAGCGGCGGAGAAAGGATAAGCGGAGATTATGATATCAATACCGTTTTACCACCACCCAAGAGCCGAGAGCTGGCGATAGGGGAAAGTTACTCAGATATCGGTGTCGCTGATAGTCAAGGCGATACAACGGACGGTAAGAAAATTGCGATAGCTGTGTATTTGGGCCAACCGGAAACATTCCAAAGATATTTCGAGGAAGTGCAAGACACGTTAACCGAATCAGAATGTAGATTTTATGCGAAACAGAGCGCCAACGCTTTTAATAACCATTTCGATAAACAGCCCAGCTTCGATTCACCGAGGATGTCACAGAATCACAGTCCGGAAATAGAAACGCTGGATTACCAACAAACTATAAACAAATCCAACACCAATAGCCTCAAAAGCAATAAATCTACGCACAGTAATTCCTATAAGAACGTATATAATAGTGATTCGACAATAAATTGCAATGAGTTTACTATTGCGTTTACATATATATCTGGCAAAACAACTTGGCAGAATTTAGATTATATAGTTAGGAAAACGTTTAAAGACTACCTGTCGAGGATAGATCCTGGCACGAATCTTGGTCTCAACACCGATTCGATAACGTCCTACCATTTGGGAGAAGCGACGAGAGGTCCGGAGATATGTTTCCCTGAACTACTGCCTTGTGGCTATATAATTGGAACTGTAAATACTCTATACATTTGTCTGCAAGGAGTAGGGAGCTTAGCTTTTGATAGCCTTATACCAAAAAATATAGTTTATAGATACGTTTCGCTTTTGTCGGAACACCGGAGGGTAATTCTTTGCGGCCCGAGCGGTACTGGAAAATCATACTTGGCAGCTAAATTGGCGGAATTTTATGTACAAAGGACGCAGCGAAGAGGAAATCCAGTAGAAGCCGTAGCTACATTCAACGTGGATCGGAAGTCGTGCAACGAATTGCGCGCGTACCTTGGGAACATCGCAGAGCAGTGCAGTGGAGCTGCAGCTGGAGAGGAGGCGCTGCCTGCCGTTGTAGTGCTCGATAATCTGCAACACGCCTCGGCTCTCGGGGACGCCTTTGCGGGTCTTTTGCCGCCAGACAACAGGAACATGCCAGTCATTATAGGTACCATGTCACAAGCAACATGCAATACGACAAATCTTCAGCTACATCATAACTTTAGATGGCTGCTGACCGCCAATCACATGGAGCCAGTTAAAGGGTTCTTAGCAAGATATCTTCGTCGAAAACTATTTTCTCTGGAACTGAGACTGGGTCGCCGCGAGCCAGCCCTTGCAGCGGTTCTGGAATGGCTGCCCGGTGTGTGGTCAACCCTTAATGCCTTTCTGGAAGCGCATTCCTCGAGTGACGTTACCGTCGGACCTCGGCTGTTCCTTGCTTGTCCCATGGACTTAGAAGCTAGCCAGGCTTGGTTTGCGGATGTATGGAACTACAGTATAGTTCCGTACGCATCGGAGGCTGTACGCGAGGGAGTTGCACTGTACGGACGGCGACGACACGCCGCCGTGGACCCTCTACAACACGTCAAGACAACCTATCCATGGAGAGAACCAAACCATTCACATACGTTACGACCGATATCAGTGGAAGACGCAGGTATTGAAGAATCGAATCAAGATGTAACCACTACCAACAATCAAGATCCACTGTTGAATATGCTAATGCGGCTACAGGAAGCAGCTAATTACAGCGGTAACCAAAGTCAGGATTCCGACAACGCTAGCATGGACTCAAATCTAACACATGACAGCTCCATGGGCAACGAGCTTTAA

Protein sequence:

>DPOGS204491-PA
MAVTTSPTTFYGSPIHDGFATIRAPRSRIKIRNLVEPNTTSNLPQRHSEYFTLNRSAKNNFLESSCGVQYATNGSQSIYVDKPSRISFTESIYAKVSPPPSNRTSPAKFLNGRQKEESHYDSIDTKPRRRLKSLELPKETDQENEEKVMIASFLNNGEVSPYEMYIANQDTAEQHSVRTNTTGSSVDYRKRNSDRDNCSESSSYVTDNDRPISSYSDNSTIPSTDTEDVLKELPSKSRFHKSPQKYATLNLRRPKFIDLKPPGVNDNPFCGSLHRNKLGYHSEPSTPLSGDSNDFTGTKNFQSGVEQIPKMPLQYNRNGFKRSVSESNGFSKRLNYRHSFSADYKPQSVVRRPHKCCECVTGVPAEDDIDTSQTSRTLGTLYESQDPKVGCQTILRSKPPVPWWELAIKKSRYKSCPILEEAHVVSAFEQSLSNMTQRLHQLTATAERKDSELTELRQTIELLRKQSIQAGLTTAHMQSMGIRADGVNVTGQEPPQNQTQQSSPQRLAQGNGAITRHLSTDSVSSINSLSSGSSVPHDKKHKKKGWLRSSFTKAFSRNAKISKTAKHSSLGQLSSQDSSSGSHHYDDPHTIREGSNENSLEHSHEVLDNSKDKTTCPPAVEEQTKEKADQAGLVDELKRQLREKDLVLTDIRLEALSSAHQLESLKDTVIKMRNEMLNLKQNNERLQRLVTSRSLAGSQSSLGTGGSAVEDPRRFSLADQATMHQAAMDIHSQPLDLDFNCMTSTPSRDLSKKGSPKSSMMEPIYGNKAVCELNENSEALLGALNGASDMFSNGLSGGERISGDYDINTVLPPPKSRELAIGESYSDIGVADSQGDTTDGKKIAIAVYLGQPETFQRYFEEVQDTLTESECRFYAKQSANAFNNHFDKQPSFDSPRMSQNHSPEIETLDYQQTINKSNTNSLKSNKSTHSNSYKNVYNSDSTINCNEFTIAFTYISGKTTWQNLDYIVRKTFKDYLSRIDPGTNLGLNTDSITSYHLGEATRGPEICFPELLPCGYIIGTVNTLYICLQGVGSLAFDSLIPKNIVYRYVSLLSEHRRVILCGPSGTGKSYLAAKLAEFYVQRTQRRGNPVEAVATFNVDRKSCNELRAYLGNIAEQCSGAAAGEEALPAVVVLDNLQHASALGDAFAGLLPPDNRNMPVIIGTMSQATCNTTNLQLHHNFRWLLTANHMEPVKGFLARYLRRKLFSLELRLGRREPALAAVLEWLPGVWSTLNAFLEAHSSSDVTVGPRLFLACPMDLEASQAWFADVWNYSIVPYASEAVREGVALYGRRRHAAVDPLQHVKTTYPWREPNHSHTLRPISVEDAGIEESNQDVTTTNNQDPLLNMLMRLQEAANYSGNQSQDSDNASMDSNLTHDSSMGNEL-