Monarch geneset OGS2.0

DPOGS204114
TranscriptDPOGS204114-TA3585 bp
ProteinDPOGS204114-PA1194 aa
Genomic positionDPSCF300184 - 115857-125092
RNAseq coverage90x (Rank: top 63%)
Annotation
HeliconiusHMEL0129070.070.87% 
BombyxBGIBMGA013603-TA2e-15255.03% 
DrosophilaCG6729-PA9e-4427.21% 
EBI UniRef50UniRef50_E9I9E82e-11537.98%Putative uncharacterized protein (Fragment) n=2 Tax=Arthropoda RepID=E9I9E8_SOLIN
NCBI RefSeqXP_001122102.14e-11237.75%PREDICTED: similar to CG6729-PA [Apis mellifera]
NCBI nr blastpgi|3228024359e-11537.98%hypothetical protein SINV_02437 [Solenopsis invicta]
NCBI nr blastxgi|3228024358e-11438.29%hypothetical protein SINV_02437 [Solenopsis invicta]
Group
KEGG pathway 
InterPro domain[267-549] IPR0193541.4e-53Smg8/Smg9
Orthology groupMCL12986 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204114-TA
ATGACTACGGTATTCTCAATTAAAGATATTCCAGAATTTTCACAAAAGGAAAAGATTGTAGTAGTAGGCATTATTGGAAAATCGCCGTTTCGATATCCAAATAAGACATCACCACTGCTTTCTTCAGTACAATTTAAAGAAAATGACATAGAATGCCATTGGGATGAAAGACGGAGTATTTTATATCTTCATGCAGTCACTTACTTCGACACTGCGAGATTGGCAGCATTGGCAGCAGATTTGAATGAGGAATCTCAGAGCACGGTCAAAGATGCTGATGCTGCTCACTGGCTTGTGGCTTCTGGAGAGTTGGCGATAGATGCATGCAGGGCTATGGCATTGATATTTCATTTGTGCCATATTGTGATTCTTTCATTACCAACCCCTGTATTTGATCTTGGATATTTACAGCTTTTTAAAGCTATTGATGCTTACAGAACAGAGGTATCAGCACAAATTACAAGCGCTTTACACCAATCATGGGCAGCTGCTTGGCAGCATGGTCGGTTATGCTGTCCTCGGCTGCTATTTCACTTCAAACGAGCCCCCACTCCACTTAGAGCTGATCCCGCTGGCCTGAAGAGATTGGAACATGCTGTGGAGGACCAGATATACTTCATACTCCGGAAGTCCCGACTTATAACTAATGTCTGTGCCAAATCTCTATTTGCTATACCTAAGAATGAGGAGTTTGTGTATATAAGCGCTGAGGAGGTGGGATCGGCTCGCGACACTAGTTCGTTACTCCGTGGCTTGGTGCAGATGTGTTCCGGTTCAGGCGCCCCAACATCATCAGACCGACCCAGCTTCAGACATTTCCTTCAAGGTCACCTGAATTTAGCTTTCGGCGAAGGCTTTGACGATAATGTTGGCAAGTACGCCATGAGCACGTCATATTTTGAGCTACCCTCAGCATCATGTTGGCGGGGCGCGGCACGTGTCTTACACCGTCAATACTCCCAGCAGTCGACCGTCCTCTTCGACGCCCTGGCCACAGACGTGAGGTTCTCTCAGGCCAGATGCGCGAAGGTGTTACCCATAGCACAGGCCTCCTACGCGGAGGGACTGCCTCCTCACTACTCCAGCCAGCACCACGCGCATAAGCTGGTGGTAGCGCTCGGTGTAGTTGAGGCGATGGCCCGAGGTCCCCTCGCTGGCGGTGCCCGAACTCGACTTCGGGCCGCCTGCGACACCATCAGACGAGCTAGGGAACTGTGCGAGGAACCCAGCCTCACACACCACCCTTGTATACATCCCAAACACGACAGTTCCAAAGAGCATTCATCGGGAGTGCGTTATGTGAGTGCGTGCAACTGCGGCCGCACGCGTGTGTCACGCGACGACCCGTACTGTGCGCGATCAGCCAACTTCACCTTCTACACCCTGGCCGCTGAGCAGTGCAATTGTGACACCTTACAAGCGATCAGCTTCCCCGTCTTTCAACCTTCAACCCCTACATTCAGGGCAGCGTCAGTTAAAGGTGCTACTGTTAAAACTCAAGAGTTGTCCGAAGTACGACAGCAGTTAGAGGTCACGGAGGCCGAGTCACCCCAAGGGGAAGGCTCGCAACGAGGCTCCTGGTCACCCCCAGACGTGCTGTCACCTGGATCAGCGAGGGACGAAGATGAAGAGGATCCTGAGGATTCCAGCGATGAAGGAATAGAGGTCGTCACAGTACAGGATAACGGCACTACAGGCACAACAGAGGTATCAGCACAAATTACAAGCGCTTTACACCAATCATGGGCAGCTGCTTGGCAGCATGGTCGGTTATGCTGTCCTCGGCTGCTATTTCACTTCAAACGAGCCCCCACTCCACTTAGAGCTGATCCCGCTGGCCTGAAGAGATTGGAACATGCTGTGGAGGACCAGATATACTTCATACTCCGGAAGTCCCGACTTATAACTAATGTCTGTGCCAAATCTCTATTTGCTATACCTAAGAATGAGGAGTTTGTGTATATAAGCGCTGAGGAGGTGGGATCGGCTCGCGACACTAGTTCGTTACTCCGTGGCTTAGTGCAGATGTGTTCCGGTTCAGGCGCCCCGACCTCATCAGACCGACCCAGCTTCAGACATTTCCTTCAAGGTCACCTGAATTTAGCTTTCGGCGAAGGCTTTGACGATAATGTTGGCAAGTACGCCATGAGCACGTCATATTTTGAGCTACCCTCAGCATCATGTTGGCGGGGCGCGGCACGTGTCTTACACCGTCAATACTCCCAGCAGTCGACCGTCCTCTTCGACGCCCTGGCCACAGACGTGAGGTTCTCTCAGGCCAGATGCGCGAAGGTGTTACCCATAGCACAGGCCTCCTACGCGGAGGGACTGCCTCCTCACTACTCCAGCCAGCACCACGCGCATAAGCTGGTGGTAGCGCTCGGTGTAGTTGAGGCGATGGCCCGAGGTCCCCTCGCTGGCGGTGCCCGAACTCGACTTCGGGCCGCCTGCGACACCATCAGACGAGCTAGGGAACTGTGCGAGGAACCCAGCCTCACACACCACCCTTGTATACATCCCAAACACGACAGTTCCAAAGAGCATTCATCGGGAGTGCGTTATGTGAGTGCGTGCAACTGCGGCCGCACGCGTGTGTCACGCGACGACCCGTACTGTGCGCGATCAGCCAACTTCACCTTCTACACCCTGGCCGCTGAGCAGTGCAATTGTGACACCTTACAAGCGATCAGCTTCCCCGTCTTTCAACCTTCAACCCCTACATTCAGGGCAGCGTCAGTTAAAGGTGCTACTGTTAAAACTCAAGAGTTGTCCGAAGTACGACAGCAGTTAGAGGTCACGGAGGCCGAGTCACCCCAAGGGGAAGGCTCGCAACGAGGCTCCTGGTCACCCCCAGACGTGCTGTCACCTGGATCAGCGAGGGACGAAGATGAAGAGGATCCTGAGGATTCCAGCGATGAAGGAATAGAGGTCGTCACAGTACAGGATAACGGCACTACAGGCACACTTTTTAAATTTCATTCATCAGGACTACCGGAGCATCTTCAGCCTGGTTTCCTGCCTCACACCAACTACCTCTTGCCGTGGGATTGTGCCGTGCGCCTGGAGCGTCCCGAGCGAGGGGACCTGGGGGTATGGAGGGGCAGGGGCAGGGGGAAGGCCCCCTCGCAGAGCCTTACAGTCAAGATATTCATAGGATACGAATATGAGTGTCCAAGAGGGCACAGGTTCATGATGTCGTCACCCGACACGGTGGTGTCAGGGGGCTCAGGGTGGTCCCGGGAGGCGGGCGAGGCCGGCGCCGGAGCCCTGCTCGCCTCCAGCTCCATGCCAGTGTTGTCCTCGTGTTTATGTCGAGCGGCCCAGCCCGCACACCTCGCACGACTGCACGTCGTCACACCGAAGGCTTCTATACACGTCACGCTCGACCCTAAGGTGCAGCCAGTCCCAGGCGGTCCAGTGTTCATTCCTCAGCCGGTCGGATCGCCGCCCATCAAGCTGAGCTCCTCAGCCTATTGGGTGCTTCGGTTCCCGTACGTGTATGCCCACGAGCACGGCGCTCTGCCCCGAGCGAGACTCCCGCCCGCCGGAAATGTTCTTCAACCAATGTTCGGACTACAGGAATAA

Protein sequence:

>DPOGS204114-PA
MTTVFSIKDIPEFSQKEKIVVVGIIGKSPFRYPNKTSPLLSSVQFKENDIECHWDERRSILYLHAVTYFDTARLAALAADLNEESQSTVKDADAAHWLVASGELAIDACRAMALIFHLCHIVILSLPTPVFDLGYLQLFKAIDAYRTEVSAQITSALHQSWAAAWQHGRLCCPRLLFHFKRAPTPLRADPAGLKRLEHAVEDQIYFILRKSRLITNVCAKSLFAIPKNEEFVYISAEEVGSARDTSSLLRGLVQMCSGSGAPTSSDRPSFRHFLQGHLNLAFGEGFDDNVGKYAMSTSYFELPSASCWRGAARVLHRQYSQQSTVLFDALATDVRFSQARCAKVLPIAQASYAEGLPPHYSSQHHAHKLVVALGVVEAMARGPLAGGARTRLRAACDTIRRARELCEEPSLTHHPCIHPKHDSSKEHSSGVRYVSACNCGRTRVSRDDPYCARSANFTFYTLAAEQCNCDTLQAISFPVFQPSTPTFRAASVKGATVKTQELSEVRQQLEVTEAESPQGEGSQRGSWSPPDVLSPGSARDEDEEDPEDSSDEGIEVVTVQDNGTTGTTEVSAQITSALHQSWAAAWQHGRLCCPRLLFHFKRAPTPLRADPAGLKRLEHAVEDQIYFILRKSRLITNVCAKSLFAIPKNEEFVYISAEEVGSARDTSSLLRGLVQMCSGSGAPTSSDRPSFRHFLQGHLNLAFGEGFDDNVGKYAMSTSYFELPSASCWRGAARVLHRQYSQQSTVLFDALATDVRFSQARCAKVLPIAQASYAEGLPPHYSSQHHAHKLVVALGVVEAMARGPLAGGARTRLRAACDTIRRARELCEEPSLTHHPCIHPKHDSSKEHSSGVRYVSACNCGRTRVSRDDPYCARSANFTFYTLAAEQCNCDTLQAISFPVFQPSTPTFRAASVKGATVKTQELSEVRQQLEVTEAESPQGEGSQRGSWSPPDVLSPGSARDEDEEDPEDSSDEGIEVVTVQDNGTTGTLFKFHSSGLPEHLQPGFLPHTNYLLPWDCAVRLERPERGDLGVWRGRGRGKAPSQSLTVKIFIGYEYECPRGHRFMMSSPDTVVSGGSGWSREAGEAGAGALLASSSMPVLSSCLCRAAQPAHLARLHVVTPKASIHVTLDPKVQPVPGGPVFIPQPVGSPPIKLSSSAYWVLRFPYVYAHEHGALPRARLPPAGNVLQPMFGLQE-