Monarch geneset OGS2.0

DPOGS209498
TranscriptDPOGS209498-TA2970 bp
ProteinDPOGS209498-PA989 aa
Genomic positionDPSCF300127 - 122701-130588
RNAseq coverage38x (Rank: top 73%)
Annotation
HeliconiusHMEL0160224e-14082.58% 
BombyxBGIBMGA007341-TA3e-14667.40% 
Drosophila% 
EBI UniRef50UniRef50_B0W0123e-3934.18%Putative uncharacterized protein n=4 Tax=Culicidae RepID=B0W012_CULQU
NCBI RefSeqXP_001842046.15e-4034.18%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700283251e-3834.18%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastxgi|1700283251e-4834.10%conserved hypothetical protein [Culex quinquefasciatus]
Group
KEGG pathway 
InterPro domain[377-486] IPR0124622.4e-20Peptidase C78, ubiquitin fold modifier-specific peptidase 1/ 2
Orthology groupMCL14646 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209498-TA
ATGTCGAGCAGCCCTTTCCCATACACGTGTGAGTTATGCGGGGCTGAAGGCCTCACAGACGAGGGTATGCGGTCGCATACTCTGGAGGCCCACGTGGCCGGAAGACCAGACTGCCCCTTCTGTGACTGCACCGTCCCGCAGCCACAGCTCGTAGGACATGTACAACGAGCACACCTACATTACCTGACGCCGGAAAGAGAACTCATGGCGTTCATTGATGATCAGAGCCCAAGTTTCGAAGAAGATTCCAAAATGACGACAACAGATAGCTGCAGTTACAACACGCCGGGCTCTATGAACGGTTGGCACAGCCCCGAGGCGGCGTCCTACCACAACGGCGCCATTTCAAAAAACTACTACAACGGCTTCCAAGACAAAGATAATTATAGAGAAAAAGATGACGATAAATACTCTCGTTCGCCCAAAAATATAAACCTTACCAACGGCATGAAAAGCATGAACATTAATAATACAGCGAAAAAGAAATGCAGCAGAGAGAACTCCATCGACCGCGATTACATTAATGGACACGATAAAAAAGCTGCACATACTAACTCAAACCATAACAGTAACGATAGCTCACCAAACAAAAATAAACTAACAATGGCAAGTGCAGGTCAAGGGTCGCCTCTTAGGTCACAACTGGCACTTAAACTGAAGTCCAATACACCTAAAAAGAATGCGCCAACGCCTAGCCCAACAGTGCAGTGTCTTCTATGTGACTTTAAGTCGACATGTCCAAGAAAACTCGAAGAGCATATAAACCGGGCTCATTTTGATTTAACTTCTCCCTCAGTGTTGGGAAATGCCAATGATAACTCCAATATTACTAACAACGCCACACTAAGTCTTAGTAACGCCACCATAACCCTGGATAATCCAACCCTGGCGTTAAGTGCTATGTCAATATCACCAGGACCGCATTCCTCGAGCTACCAATGTCCTATATGCGAAGTCGAATTTTCCAATGGGTCGGAAGTCGAGGTTCACGTCAATGTTGAGCACAGGGATATCTTAAGCCCACAGAAATCTGATCAAGCAGACAATGCCTTGTGTGATGATGTAGTTATGATGGAAGAGAGTCCTGTCAGTAACTGTCCTGTCTGCTGTCAACCATTGCCACTGTCACAACATGACTGTCAGTTGATAGACTTCCATAAACCGACGGCGGCTGACGGCTCCCATCCTGCGCTCTTCGACTATGTCCTGAGATACTTCACACACGATCCAAACGCTTTCAAACCGCCGTTATACCTTCAACATCAAGGTCACTCCAGAACAATCATTGGCTACGAAAAACACAAGGACGGTAAGGCGACACTGCTAGTTTTGGACCCGTCGCATTCCCCGGCACAGGTGCGACAGGTGTCTGTGGGGTCGTGGTCGTCAGCGGCGAGCGCGCTCCGCCTGCTGAGACGAGGAGCCCCTGCGCTGCGAGCGAGGCAATACCAGCTGCTGTGTGTGGACGGACTCATTACCACCGACCAGGAGTACCAGGTGATGTACCACCTACCCCCTACATGTACCACCTACCACTTCCATAGACAACATATAGATGGTCTTATTACCATTGACCAGGGGTACCAGGTTATGTACCATCTACCCCCTACATGTACCACCTACCACTTCCATAGACAACATATAGATGGTCTTATTACCATTGACCAGGGGTACCAGGTTATGTACCATCTACCCCCTACATGTACCACCTACCACTTCCATAGACAACATATAGATGGTCTTATTACCATTGACCAGGGGTACCAGGTTATGTACCATCTACCCCCTACATGTACCACCTACCACTTCCATAGACAACATATAGATGGTCTTATTACCATTGACCAGGGGTACCAGGTTATGTACCATCTACCCCCTACATGTACCACCTACCACTTCCATAGACAACATATAGATGGTCTTATTACCATTGACCAGGGGTACCAGGTTATGTACCATCTACCCCCTACATGTACCACCTACCACTTCCATAGACAACATATAGATGGTCTTATTACCATTGACCAGGGGTACCAGGTTATGTACCATCTACCCCCTACATGTACCACCTACCACTTCCATAGACAACATATAGATGGTCTTATTACCATTGACCAGGGGTACCAGGTTATGTACCATCTACCCCCTACATGTACCACCTACCACTTCCATAGACAACATATAGATGGTCTTATTACCATTGACCAGGGGTACCAGGTTATGTACCATCTACCCCCTACATGTACCACCTACCACTTCCATAGACAACATATAGATGGTCTTATTACCATTGACCAGGGGTACCAGGTTATGTACCATCTACCCCCTACATGTACCACCTACCACTTCCATAGACAACATATAGATGGTCTTATTACCATTGACCAGGGGTACCAGGTTATGTACCATCTACCCCCTACATGTACCACCTACCACTTCCATAGACAACATATAGATGGTCTTATTACCATTGACCAGGGGTACCAGGTTATGTACCATCTACCCCCTACATGTACCACCTACCACTTCCATAGACAACATATAGATGGTCTTATTACCATTGACCAGGGGTACCAGGTTATGTACCATCTACCCCCTACATGTACCACCTACCACTTCCATAGACAACATATAGATGGTCTTATTACCATTGACCAGGGGTACCAGGTTATGTACCATCTACCCCCTACATGTACCACCTACCACTTCCATAGACAACATATAGATGGTCTTATTACCATTGACCAGGGGTACCAGGTTATGTACCATCTACCCCCTACATTGAAACGACCAATGGTTATTCGTCTAGGATTGTTGGTAGATTTGGAAACGACTTGGAAAATTGTGTCACATTTTACATTGCTTCCCCGTAACGTGAGCGACTCTGTGTCGATAGTCTCGCTTGGGTCATTTTGTTTGGACGCCGGCTCGTGGTTGTGGTCGGTGCAGTACTCGGTAGACGTGGATTAG

Protein sequence:

>DPOGS209498-PA
MSSSPFPYTCELCGAEGLTDEGMRSHTLEAHVAGRPDCPFCDCTVPQPQLVGHVQRAHLHYLTPERELMAFIDDQSPSFEEDSKMTTTDSCSYNTPGSMNGWHSPEAASYHNGAISKNYYNGFQDKDNYREKDDDKYSRSPKNINLTNGMKSMNINNTAKKKCSRENSIDRDYINGHDKKAAHTNSNHNSNDSSPNKNKLTMASAGQGSPLRSQLALKLKSNTPKKNAPTPSPTVQCLLCDFKSTCPRKLEEHINRAHFDLTSPSVLGNANDNSNITNNATLSLSNATITLDNPTLALSAMSISPGPHSSSYQCPICEVEFSNGSEVEVHVNVEHRDILSPQKSDQADNALCDDVVMMEESPVSNCPVCCQPLPLSQHDCQLIDFHKPTAADGSHPALFDYVLRYFTHDPNAFKPPLYLQHQGHSRTIIGYEKHKDGKATLLVLDPSHSPAQVRQVSVGSWSSAASALRLLRRGAPALRARQYQLLCVDGLITTDQEYQVMYHLPPTCTTYHFHRQHIDGLITIDQGYQVMYHLPPTCTTYHFHRQHIDGLITIDQGYQVMYHLPPTCTTYHFHRQHIDGLITIDQGYQVMYHLPPTCTTYHFHRQHIDGLITIDQGYQVMYHLPPTCTTYHFHRQHIDGLITIDQGYQVMYHLPPTCTTYHFHRQHIDGLITIDQGYQVMYHLPPTCTTYHFHRQHIDGLITIDQGYQVMYHLPPTCTTYHFHRQHIDGLITIDQGYQVMYHLPPTCTTYHFHRQHIDGLITIDQGYQVMYHLPPTCTTYHFHRQHIDGLITIDQGYQVMYHLPPTCTTYHFHRQHIDGLITIDQGYQVMYHLPPTCTTYHFHRQHIDGLITIDQGYQVMYHLPPTCTTYHFHRQHIDGLITIDQGYQVMYHLPPTCTTYHFHRQHIDGLITIDQGYQVMYHLPPTLKRPMVIRLGLLVDLETTWKIVSHFTLLPRNVSDSVSIVSLGSFCLDAGSWLWSVQYSVDVD-