Monarch geneset OGS2.0

DPOGS209766
TranscriptDPOGS209766-TA5283 bp
ProteinDPOGS209766-PA1760 aa
Genomic positionDPSCF300314 + 104098-120304
RNAseq coverage62x (Rank: top 68%)
Annotation
HeliconiusHMEL0119140.064.21% 
BombyxBGIBMGA005406-TA0.074.12% 
DrosophilaCG42284-PC2e-9146.97% 
EBI UniRef50UniRef50_E2BRG50.046.04%Arginine kinase n=3 Tax=Endopterygota RepID=E2BRG5_HARSA
NCBI RefSeqXP_971656.20.049.91%PREDICTED: similar to CG30264 CG30264-PA [Tribolium castaneum]
NCBI nr blastpgi|1892414300.049.91%PREDICTED: similar to CG30264 CG30264-PA [Tribolium castaneum]
NCBI nr blastxgi|3287841300.040.17%PREDICTED: hypothetical protein LOC551249 isoform 2 [Apis mellifera]
Group
Gene OntologyGO:00163011.5e-92kinase activity
GO:00167721.5e-92transferase activity, transferring phosphorus-containing groups
GO:00038249.4e-41catalytic activity
KEGG pathwaytbr:Tb09.160.45905e-28 
 K00934 (E2.7.3.3)maps-> Arginine and proline metabolism
InterPro domain[941-1379] IPR0007491.5e-92ATP:guanido phosphotransferase
[1101-1357] IPR0224148.1e-45ATP:guanido phosphotransferase, catalytic domain
[1099-1357] IPR0147469.4e-41Glutamine synthetase/guanido kinase, catalytic domain
[280-362] IPR0206835.5e-32Ankyrin repeat-containing domain
[957-1068] IPR0224132.7e-16ATP:guanido phosphotransferase, N-terminal
[1389-1426] IPR0078581.3e-11Dpy-30 motif
Orthology groupMCL10412 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209766-TA
ATGAAATTAATGCAAACGACGGCTAATGAAATTAAAATAACCTTCCAAGCGATAGCAGCAGTGACACAAACTAAACTTCGTCAATGGGCCCTTAGCGGACCGAGTTGCAAGCTGGAGCGAGCTATACTCGCGGGACACGGCCGCCGCTTGCTAGAAGAGGCGGAGGCACTGCCATTATCAAAATACCATATAAATTTGATAGCAAAATGTAATGCATTGCATGAAGCTACTAGCAAAGGATCTCTGTTAGAGTTACAGACACTTTTGGAAGGTGAATATAACCACCAAAAGTACGTGATGTGTTACGACGAGGCTGGTGTTGGCCTTCTACACAAGGCTGTCTTCTACAACTTCATAGATATAGTTGAGTGGCTTGTTAAAAACTATCCTCAATTGGTGCACCAGAGAGATTCGTTGGGTCGTACGCCGTTGCACTATACCGCTGCGAGCCGTTCCGAAGGCTCAGCTAGTTCATTGCTTGAGTCGGCGGGGGCGGACAGGGGCGTAAGGGACGCCGCCGGACATACTATAGCTCACTACAGAGCCAACCTCACTCAACTAGCGCTACCCGACATGCCAACCCGACCTAACCCGCCCGGACTGGTGATAAAGCGTCATAACATCCGCATATGGTGCCATGAGTGTGATATGGGAAAGCTGCAACGTGTGGTTTGGGAGGGGCAGGGAGCGCGGCTCCTCAGTGAGGTCTCCAGTCAGCCAGTCGTTAAGAAGTTTCTAGAAGCTGTGCCCTATATTATGAACACTATACGCGACATCCACAATGCAGTAATTCAAAACGACTTAGAGGGACTCATAAAATTAACAGGAGAACCCGTACCGCCTCACGCCCTCTCCTGCAGAGACAGCAATAACATGACTCCGATGCATAAAGCAGCAGGGTTAGGTCACGGCGGAATTTTAAAGTACATCATTGAAAGGTATCCACAAGGTATAAAGGATGTTGACAACGATGGACGAACTCCTCTACATTATGCAGCCCTTGTCAAAGACGATCAGCATACATATAATACTCTACTAGGATCTGGAGCAGACGAAAGTGCTGTCGACAACAAAAATAAAACACCTGGTTATTATATAAATAAGAGTCATGATATAGACAAAAATATTTTCAAAACATTGCCTGACGCACCGAGAACACCTTCTAACTCGTATCCTCCTTCCTGGGATTGGAAAATTTTGGAAAGTGATTACACAGCGGATATCAATAAAAAGTTTAAAAAGAAAAACCATAGAACTTCTAATGAAAATATATCTTCAAAAAATAATACGAACACAATTTCTGAGAGCATTGAAACTAATAACGGAGTGATGAAGAATAGCAGCACACAAGAACTAATTTCTACTTTTCCTGATTTAAATGAAAACGAAAAGGCAGAACCGGAAACTACCGAAACTAATGAAAATACTACACATATCCAGGAAGATACACGTATTGATGAGGATAATGAAATTAAAAAAGAGGTCAGTAATAATGAAGAGAGCTCCGAGACTGAAAATCAAGACGAGCAAGACCAAAATAACTCAAAAATTATAGAAAATGATGACGGTATTAATGAAGAAAACCTTAATGATGAAGATCACAATAAAGAAGGAGAAATCAATGACGTTGTTCTTAATAGCAGTTCTGACGTTAATATTACTAATGAAAAGAGTGAACAAGATCAGAAAGAAAATAATATCGTAGATTTTAATAGTACACAAAAAGAAGTTGTTGAAGATACTCAAAATATAGATGATGCAGATGATGAAAATATTAAAGACCATCATAATATACAAGAAGAGAATGATAAAAATAAAAACATAGAAGATATTGATGATAAAGATGATAAAGATGTAAATGTGGATATGGATAATAAAGACAAGGAGGAAGTGGTCGAAAATAATGACATTCCTGATAGCCTAGAACTAAATATGAATGAATCTAAAGTAAATAATGACTTAGAAGAAAATGTATTAGACAATGATAGTCAAGAAGACAAATCTGATAACCATAATGAAGATATATCAAAAGAAACGAACGATATTATCAATGAAGACTTAATAGTTGGAAGGGCTTCATCAAATACGTCAAATAAAGAAGGAAACCCTGTCAGCGGCCGTAATATTCAGGAAAGTCTCATAGAAGGATTAATTAGTAGCGAAGCAGAACAGGAAACAGAAGACGCGAGTATATCACAAAGAGAGAGCTCAGTTCACAACGATATTTTAATTGTAGACAATGAAATAGATCCTGAAGTTACCGATTTAATTAATACTGCAAACATGGAAATGTTAGCTACATTAGTATTGAACGGAGAAGGAGCCAGATTGATCGGAAGACATTCTGGAAATTTAGAATTGCAAGCTTTTTTAAATAATGTCCCAACGTATATGCAAAAGATAAATAGAGTACATATTGCAGCTAGGGAAGGGAATATAAGAGATTTACAAGCAGCTCTAGATCGTAGAAAATTTGCTATTGCCCGAGATCCTATATCACCGAACGGAGCAACTCCTTTGCATGTAGCAGTTGTTTTTGGTAAAACGAATATTATAAAATATCTAGGAGGGAGGTTTCCTGAAACATTGTCTGCTGTTGATTTTGAAGGCCGTACAGCTTTACATTATGCGGCTGTTTTACCAGATAATGGTCATTATTTTAATTTGTTACAACAGCTGGGAGCAAATTCTAAAGATTTAGATGATATGGGACGATCTGCTGAGGATTATTACAAAAATCCTACTCTGTTGACTTTCAAACAATTGTTAGCGGACTTTGGAGTAAGTGAAGAAGTAGCTCAAGAAATGTTTACAGATAAAGTGCCAGATGATCGCGTTTCTTCCCGTCGAGTTTTAGATAACTCTGAAGCCTTGGATACTTTAGAACGATGTTATAGATTGCTAGCATCTGCAAGACCAACAAGAACACCTCTGTCTGCGTCTTCTAATAAAGCCACTCCTCCAAATGTTTTAGGAAGATTCCTTAAACGACAAATATTTCTTATGATAAAGGATAGAGTTACAAAACTAGACCACAACCTCTTTGACGTTATCTGGCCAGCTGTAAAAAAGCTACCTGATAGCAGAAATATTATACAAACCGTGGAAGAGGATTTTCCTGGTGGAGTCACTGCTCCAGATTATTATGTTTATGAAGTATTTTCAGAATTTTTAACGCCACTCATTAAAGATCTTCACAATATGAATGTTAATTCTGAGCTACCAGAACATCCAATATCAGATTTTGTTAAAAATATACCATTAAAGGAAGTCACTGAAGCTCCGATAGAAATAAATGTAGATCCTGGAGAAGATTTCGTACTGTCAGGAACAATGGAATGCTCAAGAAACCTCGATGGATTTGAATTACCGTTGAACTTAAAAATTGGCAAACTCGAAACCATTGAAAGAATTATAACTACAATTTTGATGAGACCAGAGTTCTCAAAAATTATAGAACAATTCAATCCCGAGTCCGACCAAAGATGGGGAACTTATTACACTTTGAATGAGGTGTTAGAGAAGCCTTCCGAAATTAGTGCAACGCTAGCAGCGTCTGGATTACTTATCCCAATTTGTGATAGAGAAGAAATTGATGATAGCACTCATCTTCATGGTCAGCATTGGCCGTATGGGCGTGGGGTTTTTTTAAGTGACGATAAAACTGTGGCCGTTTGGATCAATGTTCATGATCATTTACGTGTACTAATTTCAACTCCCCTTGATTCACCGGGGGAAATTGGATTAACATTCAGCACATTGACTCGCATAATGTCATATTTACAGAATAACCTAGATTTCGTTTGGGATCATAAATTAGGTCATTTGTCCAGCAGACCATCTTTTTTAGGGGCAGGCATAAGATTTAGCCTTATTGTAAACTTTCCCGGTTTGGCAAAAGATACTGATAATATGAAACATTTATGTGCTATGCGAGGTTTGCAATACAGAGAAACCTTAAGTCCTGATATAGCAAGAATAAGCAATTATCAGTGTTTAGGAATAACAGAATCAAATTGCTTTAAAGATTTTGCTACGGCGACTTCAAATCTAATTCATCTAGAAAAAGATTTTCTGATGCAAAATTCTGCTCATATAGCAACATTGTTAAATATAAGCACCACGGCTAACGGCCACATGGACATTCCCATATTTCAAACTGAAGAAGGCAGATATCTCGCAAAGTCACTCGGCGATCCTTTGATTAAAGGTCTGACCGAAGTGGCTAATATCAAACCTAAAGATCCTGTAGCATTTCTAGCGACTTTCCTACACAACTTTCCTGAACATGAGAAGCCTCGACCTGAAACACAGGAATCAAATGTATTAGTGACAAAAGAAGCTGCTGAATTTGAAAACGAACATCCGCCATACGCAGACGAGATGGAAGATGATAATGAGGATCCGTCAGTAATAAGAGAACAAGATCGGTATCGCACCGCAACAAAACCACGCCCCATCGACGTTATAACTGTAGATCCTCAAACCGAGACCAGTCCGGATGCACCGGAAATCGCTAGCAGTAGTGCGAATAGGGATGAACACGGTCAATCAATGTTACATTTTGCTGCTGCTCGAACACATACAAACAATGCTCTGTTTCAACTATTGCAAGAATCTGAAGTGAGCCTCGGTTATAGAGACGAGCTGTATCGGACAGCGCGCGACGTCTCCATACAAGCGAACGTATTAGAAAATACTGTGGAGATAGACAGATGGGTACTTTCATTGGCTGCGAGAGGGAAGACAGATAAAATCATGGAATTGCTTATTCAAGGCTACGATCATATATTGGATATTGTGGATGAAGAAGGAGTGCCCATGTTAGAAGTAGTTGGCCAACGTGGTGACGATAGCATGAACAATTTACTCGCCTCCATTCCACCCTTCGAGGAATCTCGGGAATCTCTCCATGGAGCAGTTCGTCGTGGTGATATGAACGCTGTCCGTGAAATTATATCTGGTGATCGTGGACATACCCTAGCACGTGCTCACAACGCACTCGGTCGCACTTCTCTCCACGTGGCTGTGTTGGCACAACATGAAGATGTTGTAGACTACCTTTCAGAGACATGCCCTGAGCTACTGCGGGTCGGGGACAATTTGGAAAGAACGCCATTACACTATGCAATGGGAATGGAGAAAATGGAATCTCTAAGTCGTATTCTTATTAAAGCGGGCGCTAAGCGCGTTCTTAAGGATCTCAAGGGCCGTCAGCCTTCTTATTATTTCATGAACAAGTCAGATATACTACGGTTGAAAGAAGAAGAAGAAGCTTACTAA

Protein sequence:

>DPOGS209766-PA
MKLMQTTANEIKITFQAIAAVTQTKLRQWALSGPSCKLERAILAGHGRRLLEEAEALPLSKYHINLIAKCNALHEATSKGSLLELQTLLEGEYNHQKYVMCYDEAGVGLLHKAVFYNFIDIVEWLVKNYPQLVHQRDSLGRTPLHYTAASRSEGSASSLLESAGADRGVRDAAGHTIAHYRANLTQLALPDMPTRPNPPGLVIKRHNIRIWCHECDMGKLQRVVWEGQGARLLSEVSSQPVVKKFLEAVPYIMNTIRDIHNAVIQNDLEGLIKLTGEPVPPHALSCRDSNNMTPMHKAAGLGHGGILKYIIERYPQGIKDVDNDGRTPLHYAALVKDDQHTYNTLLGSGADESAVDNKNKTPGYYINKSHDIDKNIFKTLPDAPRTPSNSYPPSWDWKILESDYTADINKKFKKKNHRTSNENISSKNNTNTISESIETNNGVMKNSSTQELISTFPDLNENEKAEPETTETNENTTHIQEDTRIDEDNEIKKEVSNNEESSETENQDEQDQNNSKIIENDDGINEENLNDEDHNKEGEINDVVLNSSSDVNITNEKSEQDQKENNIVDFNSTQKEVVEDTQNIDDADDENIKDHHNIQEENDKNKNIEDIDDKDDKDVNVDMDNKDKEEVVENNDIPDSLELNMNESKVNNDLEENVLDNDSQEDKSDNHNEDISKETNDIINEDLIVGRASSNTSNKEGNPVSGRNIQESLIEGLISSEAEQETEDASISQRESSVHNDILIVDNEIDPEVTDLINTANMEMLATLVLNGEGARLIGRHSGNLELQAFLNNVPTYMQKINRVHIAAREGNIRDLQAALDRRKFAIARDPISPNGATPLHVAVVFGKTNIIKYLGGRFPETLSAVDFEGRTALHYAAVLPDNGHYFNLLQQLGANSKDLDDMGRSAEDYYKNPTLLTFKQLLADFGVSEEVAQEMFTDKVPDDRVSSRRVLDNSEALDTLERCYRLLASARPTRTPLSASSNKATPPNVLGRFLKRQIFLMIKDRVTKLDHNLFDVIWPAVKKLPDSRNIIQTVEEDFPGGVTAPDYYVYEVFSEFLTPLIKDLHNMNVNSELPEHPISDFVKNIPLKEVTEAPIEINVDPGEDFVLSGTMECSRNLDGFELPLNLKIGKLETIERIITTILMRPEFSKIIEQFNPESDQRWGTYYTLNEVLEKPSEISATLAASGLLIPICDREEIDDSTHLHGQHWPYGRGVFLSDDKTVAVWINVHDHLRVLISTPLDSPGEIGLTFSTLTRIMSYLQNNLDFVWDHKLGHLSSRPSFLGAGIRFSLIVNFPGLAKDTDNMKHLCAMRGLQYRETLSPDIARISNYQCLGITESNCFKDFATATSNLIHLEKDFLMQNSAHIATLLNISTTANGHMDIPIFQTEEGRYLAKSLGDPLIKGLTEVANIKPKDPVAFLATFLHNFPEHEKPRPETQESNVLVTKEAAEFENEHPPYADEMEDDNEDPSVIREQDRYRTATKPRPIDVITVDPQTETSPDAPEIASSSANRDEHGQSMLHFAAARTHTNNALFQLLQESEVSLGYRDELYRTARDVSIQANVLENTVEIDRWVLSLAARGKTDKIMELLIQGYDHILDIVDEEGVPMLEVVGQRGDDSMNNLLASIPPFEESRESLHGAVRRGDMNAVREIISGDRGHTLARAHNALGRTSLHVAVLAQHEDVVDYLSETCPELLRVGDNLERTPLHYAMGMEKMESLSRILIKAGAKRVLKDLKGRQPSYYFMNKSDILRLKEEEEAY-