Monarch geneset OGS2.0

DPOGS206945
TranscriptDPOGS206945-TA3249 bp
ProteinDPOGS206945-PA1082 aa
Genomic positionDPSCF300001 - 402219-411662
RNAseq coverage176x (Rank: top 50%)
Annotation
HeliconiusHMEL0021232e-14641.68% 
BombyxBGIBMGA012941-TA5e-12644.84% 
DrosophilaCG4830-PA5e-5428.33% 
EBI UniRef50UniRef50_D6WF943e-10127.68%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WF94_TRICA
NCBI RefSeqXP_973874.24e-7232.18%PREDICTED: similar to CG6178 CG6178-PA [Tribolium castaneum]
NCBI nr blastpgi|2700038431e-10027.68%hypothetical protein TcasGA2_TC003124 [Tribolium castaneum]
NCBI nr blastxgi|2700038432e-9927.61%hypothetical protein TcasGA2_TC003124 [Tribolium castaneum]
Group
Gene OntologyGO:00081521.5e-79metabolic process
GO:00038241.5e-79catalytic activity
KEGG pathway 
InterPro domain[62-471] IPR0008731.5e-79AMP-dependent synthetase/ligase
Orthology groupMCL10359 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206945-TA
ATGCAGCAGTCTTTTGATTCATGCAATTATTACTTCAATGAAATATCTAATAAGGTCACAGCTGAATCTGGAATATGGACCGACGGAGAGCATCTTGGAAAAATAATCATACGATGTCTCAAAGAAGCACCGAATTTTATAGCACAGATAGATGGGGGAACAGGTGAGAAGGAAACAAATAAGTCCGTTTTGGAGAGGACTGTAATGTGCGCTCAGAGTTTCATAAATTTTGGCCTGAAATACCAAGATGTTGTAATGGTCATTGCACCGAACCATCTGCACATCAGTATTCCACTGTACGCCGCGTTCTGTACTGGTGTAATTTTCGCTGGAATAGACTTTAACTTGGGAGAGAATGAGTTAGCGGACACATTTAAATCGGGCCAGCCAAAAATGATATTCTGTCAGAACTCGAATCTGCAAACAGTCCGTAAAGCACTTGCAAGAATAAAAAGTAATGCTGAGATAGTTACGTTCGATGAAGGGCAAGACTGTATAAGTTTTACGAAATTCATTTCTAAATACAGTGGGGATGCTACTGTTGAAAATTTCAGGATTTGCGATTTTGAACCAGTTGAAACCATAGCATTGTTAATCGCTACAAGCGGTTCCACAGGTTTACCTAAAGTGGCTGTACTAACTCACCAGAACGTTAGCGTTGGCTTCATACAAAATTGGAAAGGTTTATCAAAGGCCCCAAATCCATTCGATATAGGTTTGGTGATATCTCCAATTCAATGGATATCTTCAACTTTCCAGATAGTAATGTCACCAATTTTGAGATACACCAGATTACAAACATCAAATAAACTGTCCCCTGAACACGTTTATGACTTAATTAATAAATATAAGCCAAAATATACCATCTGTAGTCCCACATACATGACAACTTTACTTAGAAACGATCATCAGCATGTATGTGATTTTACATCATTCAAATATATTCTAATTGGTGGAAGTGCTGTGTCAAAAGAGCTTTACGCAGACCTAAAGAAAGTAGCTCCAAATGTAATGATACAAGTTGGTTACGGTATGAGTGAGGCATCCGGATTAATATTTTCACCACATTACGTACCTCTGGGTTCAATTGGAAGACCCATGGAACATGTCAATTGGAAACTCGTGGATCCTGATACTGAAGAAATAATTCCTGAACCATATAAGGCCGGAGAGATACGAATAAAAGGGAGATCTATATTTAAGGGTTATTACAACAATCCCGAAATGACCGCACAGGCTTTTGACAAGGATGGCTGGTTGAAGTCAGGAGATATTGTATATAGAGATGAAAACTACAATTTCTTTTACGTGGATCGTCAAAAGTTGCTGCTTAAATACAGGAATCATCAGGTATCACCGTTAGAAATAGAAAATGTTATATTAAAACACCCGGGAGTTGTGGATGTGGCGGTATCAGGTATACCAGACCCTGAATATGGTGACCTTCCAATAGCTTTTGTGGTGAAGAAGAATGATTACGATCTCACCGCGCAATGTGTCGAAGATTTGGTCAAAGAAACACTAACGGACTCAAAACAATTGAGGGGAGGCGTTATTTTTCTGGACGAGCTCCCTGTGACATCAACATCAAAGCTAGACCGAACGAAATTAAAGAATATGGCAGTCAACATGGCAAAATGGGTAAGAAGCAGGAATGCAGTAAACATGCACCTGGAAGAACTATCTTCAAGAATAGTGGCTGATTCTGGTATACCAACTGATAGATATCATTTAGGAAAACTGATATTGCAGAGCCTTAAAGATGCTCCCGATTATCTGTCACAGATTGACGGGGCCTCTGGAGAGACTGAAAATTTCGAATCGGTTCTGAGACGATCTGTTCGATGTGCTACAGCATTAAAGAATTTAGGGCTAAAACAGGGAGATGTGGTGGTTTTGATGGCACCGAACCACATTCATCTATGTATACCCATTTACGCTGCATTGTACATTGGAGCAATTGTTGCAGGAATTGACATGAACTTAAAAATCAATGAACTTAAGGATAGTTTCAAAATAAACAAGCCGAGCGTAATATTTTGCCAGAGCGAAAAAGCCGCTGATATTAATTTGGCTTTAAGCAATTTGAACATCGATCCTAAAATAGTAACATTTGACAAAGGGAGCGACTATTTGAATTTTCATCAATTTGTTGATAAATATGGCGACGATACTCCTGTCGAAGAATTCAAAGCTACCAATCTGGATCCAAATGAGGCGATAGCTTTACTGATCTCTACAAGCGGAACTACAGGCTTACCAAAATCGGCTGCTGCTACACATGCAAACTTTGCAATATCAGCTGCTAACATGTGGGTCCTCTTTGATACTTGTCCGTCCCCAACTCGCCTATCCGTTATAATGTCACCCCTACAATGGTACTCAGCTTTATTCCAATATATATATACGCCAATAGTGAGGACAACGCGTCTCCAGTCCTCTTTACCAATGACACAGGAACACGCATACTACATTATAAATAAATATAAGCCGACATTTACAATGTGCAGTCCGAATATGTGGGCTGAGCTCTTCAAGAAGGGAGATCGTGACAAATGCGATTTAAGTTGTTTCGATCTTATTATGGCCGCCGGCAGCGATGTACCATCCACACTCTTCGATACTATAAACTCGGTCGTCCCAGAGACATGTTTCATACCAGCGTATGGTTTGAGCGAGATATCGGGAATCGCATTTGTTTACGACAGCACAAATCCAAGGTCGTTGGTGTCACCGGAAACAAAATTAGATGTTACGGAACCGAACGTTCCTGGAGAATTATTTATAAAAGGACCAGCCGTTTTTAAAGGCTATTACAACGATGAAAAGTGTACAGAGGAGACCTTTACAGATGATGGTTGGTTCAAAACTGGTGATATATTTAAGAGGGACGAGAATTGGTATTTCTACTTTGTGGAACGAAGAAAGATGTTGCTGATACATAAAAATTACCAGGTTTCCCCTTTGGAAATAGAGAATGTAATTATTCAACACCCAGCGGTATACCAAGTTGCGGTAACCAGTGTTCCACATCCTGAACATGGAGATCTGCCCGTGGCTTGCGTAGTTAAACATAAGGACAGTACTGTAACTGCCCAGGATATTAAGGATATGGTCGAAGAAACATTATCGGAACAAAAGCATTTGTCTGGAGGAGTGATATTTTTGGATGCACTACCAATGACCTCAACATCCAAAGTAAATAAGTCCAAACTTGCGGCTTTGGCTCGAGTTTCGGAACGACTGTAG

Protein sequence:

>DPOGS206945-PA
MQQSFDSCNYYFNEISNKVTAESGIWTDGEHLGKIIIRCLKEAPNFIAQIDGGTGEKETNKSVLERTVMCAQSFINFGLKYQDVVMVIAPNHLHISIPLYAAFCTGVIFAGIDFNLGENELADTFKSGQPKMIFCQNSNLQTVRKALARIKSNAEIVTFDEGQDCISFTKFISKYSGDATVENFRICDFEPVETIALLIATSGSTGLPKVAVLTHQNVSVGFIQNWKGLSKAPNPFDIGLVISPIQWISSTFQIVMSPILRYTRLQTSNKLSPEHVYDLINKYKPKYTICSPTYMTTLLRNDHQHVCDFTSFKYILIGGSAVSKELYADLKKVAPNVMIQVGYGMSEASGLIFSPHYVPLGSIGRPMEHVNWKLVDPDTEEIIPEPYKAGEIRIKGRSIFKGYYNNPEMTAQAFDKDGWLKSGDIVYRDENYNFFYVDRQKLLLKYRNHQVSPLEIENVILKHPGVVDVAVSGIPDPEYGDLPIAFVVKKNDYDLTAQCVEDLVKETLTDSKQLRGGVIFLDELPVTSTSKLDRTKLKNMAVNMAKWVRSRNAVNMHLEELSSRIVADSGIPTDRYHLGKLILQSLKDAPDYLSQIDGASGETENFESVLRRSVRCATALKNLGLKQGDVVVLMAPNHIHLCIPIYAALYIGAIVAGIDMNLKINELKDSFKINKPSVIFCQSEKAADINLALSNLNIDPKIVTFDKGSDYLNFHQFVDKYGDDTPVEEFKATNLDPNEAIALLISTSGTTGLPKSAAATHANFAISAANMWVLFDTCPSPTRLSVIMSPLQWYSALFQYIYTPIVRTTRLQSSLPMTQEHAYYIINKYKPTFTMCSPNMWAELFKKGDRDKCDLSCFDLIMAAGSDVPSTLFDTINSVVPETCFIPAYGLSEISGIAFVYDSTNPRSLVSPETKLDVTEPNVPGELFIKGPAVFKGYYNDEKCTEETFTDDGWFKTGDIFKRDENWYFYFVERRKMLLIHKNYQVSPLEIENVIIQHPAVYQVAVTSVPHPEHGDLPVACVVKHKDSTVTAQDIKDMVEETLSEQKHLSGGVIFLDALPMTSTSKVNKSKLAALARVSERL-