Monarch geneset OGS2.0

DPOGS202693
TranscriptDPOGS202693-TA3138 bp
ProteinDPOGS202693-PA1045 aa
Genomic positionDPSCF300324 - 56325-68292
RNAseq coverage76x (Rank: top 65%)
Annotation
HeliconiusHMEL0105003e-11054.10% 
BombyxBGIBMGA004861-TA4e-12249.79% 
DrosophilaCG6428-PA1e-15148.03% 
EBI UniRef50UniRef50_Q9W4N62e-14948.03%CG6428 n=27 Tax=Eukaryota RepID=Q9W4N6_DROME
NCBI RefSeqXP_001992251.14e-15347.35%GH24648 [Drosophila grimshawi]
NCBI nr blastpgi|3838594634e-15347.65%PREDICTED: L-asparaginase-like [Megachile rotundata]
NCBI nr blastxgi|3838594637e-14847.65%PREDICTED: L-asparaginase-like [Megachile rotundata]
Group
Gene OntologyGO:00065202.1e-206cellular amino acid metabolic process
GO:00040672e-83asparaginase activity
GO:00055153.4e-06protein binding
KEGG pathway 
InterPro domain[444-1045] IPR0060342.1e-206Asparaginase/glutaminase
[477-837] IPR0060332e-83L-asparaginase, type I
[844-1044] IPR0206838.3e-37Ankyrin repeat-containing domain
[337-366] IPR0021103.4e-06Ankyrin repeat
Orthology groupMCL10898 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202693-TA
ATGATAGAATATGAAATTTTACTAGATTCTTCAAACATGGCTTTTGATGACTGGATTCAAATAGCGGAAGATGTCATGAAATATTACGACCAATATGATGGTTTCGTAGTTCTTCACGGTACTGACACACTGTCGTACACGGCGTCGGCCTTATCGTTTATGTTCGAAAATATTGGCAAAAGCATTGTACTCACTGGCTCACAGATTCCAATATTTGAACCGAGGAGTGACGGGTCAGATAACTTGGTGTCATCGTTGCTCATAGCCGGTGGGCTGAACATACCAGAAGTCACCGTGTTTTTTGGAAATAAACTTTATAGAGGAAATAGAACGAGAAAGATATCAGCTAACAACCTCAACGCCTTTAGCTCTCCGAATTGTGTGCCCCTAGTGGAAGTTGGTATCGACTTTGAAGTAAACAAAAAGGCGATATTCAAACCTACCAACAAAGAGAGATGTCACTTACATGCTAAAATGTCCAAAAATGTTGGTCTTCTCAGGATATTTCCAAGTATAAGCACATCTTTGATTAGAGCATTCTTTCAGCCTCCGATTGAAGGCGTAGTATTAGAAAGCTACGGGGCTGGCAACATCCCGTCTAACCGTGAAGATCTGTTAACGGAAATATCAGCAGCTGTTAAGAGAGGAATGATTGTAGTTAACATCACCCAATGCTTAAGAGGAGGAGTTGTGGCGGCCATGTACGAAACGGGAAAGTTTTTATCAGAATGCGGAGTGGTGTCAGGTTACGACATGACGCCGGAGGCTGCTCTCGCCAAACTATCGTACGTGCTGTCCAAAACAGAATTGACGTACCAACAAAAAGTTGATGACGATACACTCTTAGAAGCGTTGGCGTTAACTCTAAACATAGAATCACAAAATAAATTAGTTGAGGTGACAGGAAAGGTATTTAATGCGCTTCTGTTGTATGCGATAGGAAAAAATGATGTTGGCGCCGTCAAGATGATGCTGGACATGGGAGCAAATATTAACACAAAAAACTCTGATGGCAGCACTGTATTACATGAAGCTGTTCTTAAAGGAAATATGCAAATGATTGAGTATTTGTTGAAAAATGGCGCTGAAGTTAATATTTGGACGAGATGTGGGGAGAGCCCTCTCTTAGCAGCTATTCACAAAGATAACGTCGCTGTGATATGTCTACTACAAAAGTACGGAGCATGCCTCAGTAGCGAGGATAGGAAAATTGTGGCAGATATGTCTTCATTGGCAGCCAGGTGTGGAGATCTCAAAAAACTTAAGACCTTGATCGCAGCGGGTTTAGATCTCAGTGCGCCAGATGAGATAGGGCAGAATCTATTGCATAAAAAAATTGAGGCGAAAGTGAAAAAGGCTAGAAGTTTCGTTGAACTGTCTATATTAAACACGTCGAATTCGGTGAAGAGTTTGGGGAGGAATGAGAGACGTGTTTTAGTTTTATACACCGGTGGCACCATCGGTATGGTGAAGAATAAAGATGGAGTTCTGGTGCCACAAAAAGGAGCTTTTGAAAACCTCATTCGAGGGTACCCACAACTCCATGACATTATGTCATGGCGCCAAAGACTAAGTGAACCCAATTTTGACACTTCCTTTCTTGTGTTGCCAGAGGCCAAGGAATTGGATATGAGAATTTCTTATAAAATAATAGAGTACGAAAATTTACTGGACTCTTCGAACATGACTGAGCAAGAATGGATAAGAATAGCAGAAGATATAATGAAACACTATGAAGAGTATGACGGTTTCGTAGTTCTTCACGGTACTGACACATTATCATACACAGCATCTGCCTTATCGTTTATGTTCGAAAATATTGGTAAGGGCATTGTACTCACTGGCTCACAGATTCCAATATTTGAACCGAGGAGTGACGGGTCAGATAACTTGGTGTCATCGTTGCTCATAGCCGGTGGACTGAACATACCAGAAGTCACCGTGTTCTTTGGAAATAAACTCTATAGAGGAAATAGAACAAGAAAGATATCAGTAAACAATTTATACGCATTTAACTCCCCGAATTGCGTCCCACTCGTCGAAGTTGGTATCGACTTTGAAGTAAATAAGAAGGCGATATTCAAACCAACAGTCATAGAGAGATGTCACTTACACGCTAAGATGTCCAAAAATGTCGGTCTTCTTAGGATATTCCCTAGCATCAGCGCGTCCGTGGTTAGAACATTCTTTCAGCCGCCAATTGAAGGTGTGGTATTAGAAAGCTACGGTGCTGGCAACATACCATCGAACAGAGAGGATCTATTTTCGGAAATAGCGGCAGCAGTAAAGAGAGGAATGATTGTTGTTAACATAACTCAATGCACCAGAGGCAGTGTCATTTCTCCAATGTATGAGACTGGTAGATTAATATCGGAGTGCGGAGTGGTATCAGGTTACGACATGACGCCAGAGGCTGCCCTCACTAAACTATCTTACGTGCTGTCCAAAACAGAACTCACTTATCAGCAGAAAGTTGACATGATGGTGACGAACATAAGAGGAGAATTAACAAATACGTCATCTATAGCTATCGAGGACAATACTCTTATAGATGCTCTAGCATCAAGTTTGAACATACAGTCACCAAAGAAATTAATAGAGGTGACAGAAAAAGTGTTCAGTGCTCTCTTGTTATATGCAATAGAACATGATGATCTCAGAGCTGTTAAGAAGATGCTGGATATGGGTGCCGACGTTAACGCTCAAAACTCAGAAGGCAAGACGGTGTTGTATGAAGCCATTCTGAGGGGAAACATGCCTATTGTTGAATGTGGCGAAACACCTCTCTTAACGGCTATACATAAGGATGACCACACAATAATATCTCTTCTCCGACAATGTGGAGCGCACCTCGCTAACGTGGACACAAAACCAGTTGCTGAAATGCTATCACTAGCAGCCAGGAGTGGTGTAGTTCACAAACTGGAGAGCTTGAGAGCCGCGGGTTCGGAACTCAATTTACCAGACGAGATCGGTCAAACACCCTTGCATAAGGCCGTTCTCTGTAATAATCCAGCGGTGGTCAGATATCTGTTATCGCAGGGAGTGGACAAAGAAACCAAAGACATCCTTGGATTCACTCCGATGGATTGCGCGATAAAGTTGGAACTCACCAATATTATTGATATGTTGAAATAA

Protein sequence:

>DPOGS202693-PA
MIEYEILLDSSNMAFDDWIQIAEDVMKYYDQYDGFVVLHGTDTLSYTASALSFMFENIGKSIVLTGSQIPIFEPRSDGSDNLVSSLLIAGGLNIPEVTVFFGNKLYRGNRTRKISANNLNAFSSPNCVPLVEVGIDFEVNKKAIFKPTNKERCHLHAKMSKNVGLLRIFPSISTSLIRAFFQPPIEGVVLESYGAGNIPSNREDLLTEISAAVKRGMIVVNITQCLRGGVVAAMYETGKFLSECGVVSGYDMTPEAALAKLSYVLSKTELTYQQKVDDDTLLEALALTLNIESQNKLVEVTGKVFNALLLYAIGKNDVGAVKMMLDMGANINTKNSDGSTVLHEAVLKGNMQMIEYLLKNGAEVNIWTRCGESPLLAAIHKDNVAVICLLQKYGACLSSEDRKIVADMSSLAARCGDLKKLKTLIAAGLDLSAPDEIGQNLLHKKIEAKVKKARSFVELSILNTSNSVKSLGRNERRVLVLYTGGTIGMVKNKDGVLVPQKGAFENLIRGYPQLHDIMSWRQRLSEPNFDTSFLVLPEAKELDMRISYKIIEYENLLDSSNMTEQEWIRIAEDIMKHYEEYDGFVVLHGTDTLSYTASALSFMFENIGKGIVLTGSQIPIFEPRSDGSDNLVSSLLIAGGLNIPEVTVFFGNKLYRGNRTRKISVNNLYAFNSPNCVPLVEVGIDFEVNKKAIFKPTVIERCHLHAKMSKNVGLLRIFPSISASVVRTFFQPPIEGVVLESYGAGNIPSNREDLFSEIAAAVKRGMIVVNITQCTRGSVISPMYETGRLISECGVVSGYDMTPEAALTKLSYVLSKTELTYQQKVDMMVTNIRGELTNTSSIAIEDNTLIDALASSLNIQSPKKLIEVTEKVFSALLLYAIEHDDLRAVKKMLDMGADVNAQNSEGKTVLYEAILRGNMPIVECGETPLLTAIHKDDHTIISLLRQCGAHLANVDTKPVAEMLSLAARSGVVHKLESLRAAGSELNLPDEIGQTPLHKAVLCNNPAVVRYLLSQGVDKETKDILGFTPMDCAIKLELTNIIDMLK-