Monarch geneset OGS2.0

DPOGS207067
TranscriptDPOGS207067-TA1293 bp
ProteinDPOGS207067-PA430 aa
Genomic positionDPSCF300001 + 2298219-2307831
RNAseq coverage872x (Rank: top 15%)
Annotation
HeliconiusHMEL0102056e-14565.37% 
BombyxBGIBMGA013113-TA8e-15456.76% 
DrosophilaCG31344-PA3e-5230.55% 
EBI UniRef50UniRef50_B0WPZ34e-6335.48%Putative uncharacterized protein n=2 Tax=Culicinae RepID=B0WPZ3_CULQU
NCBI RefSeqXP_001850777.17e-6435.48%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700464481e-6235.48%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastxgi|1700464484e-6234.70%conserved hypothetical protein [Culex quinquefasciatus]
Group
KEGG pathway 
Orthology groupMCL12567 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207067-TA
ATGTCTACTGAAGTTGATGTCTATAGACTAACGTCTTCACTTGTGGATGAATACATAGAGGGCTTATTTTCTAAAGATGCCTTCGAACAACCATCTGATTCCAAAAAGCCTGAAGATTTGGAGATGAAATTACCAGAATGGTTCGATGAGAAACAGTTCAATAAAGCCAGGAGATTCTACTGGAACAACTGTTATATGCTGTCGTCGTCGATGCTGAACGGCCTCGTCGCTGTGTTCGCTATACCATCCATATTACAAGTTTTAATTGGCACAAAACGTTCAAACTCTCCGTACACGGCGTACAAACGTTACCTCTCAACGTTACTTCACACGCTTAGCTGGTTCGAACACGAATTGAAACCTGGTAGTAGATCTTGGAGATCCCTTTTCACTGTGCGTACTCGTCACTTCAAAGCAAGTTTAGCTGCGAAACTCAAAAATCAACCGTTAGTCTCCCAAAGGGACCTTGCACTCACACAATTCGGTTTCATTGGTTTCTCCATGATCAAATCGGACAAATTCGGCATCCGTCAATTTGAAACTGGCGATTGGGAAGCATACAACCACTTTTGGCGGACCATCGGACATCTTATCGGATTGGAGGACAAGTATAATCTGTGCCGCAAGAACGTCGATGAAACCCGTCAGGCCTGTCAGATACTGCTGGATCGTGTGTACACCCCCTGTCTAGAGAACGTTCCAGAATACTTCGAACATGTATCACGCGCTATGCTGGAAGGCCTGTGGTGTGTCAATCCCACGGTACATATCGATGGGATGCTATACTTTACGAAGTACTTGTGCTCGGTCCCAGGGTATGTTTATACGGAACAAGACAGGATCGAGCTGCAGCAGAAGTGTATGAAGCAGTTAAATGGACGGTCGGACGAAATTGGTATCGAAACATCATCACTAATGGCAAAGCCTTTAATTGATCTGCCTCCAAGGAAACATCTTATTTACATTAGAGATTACGACAGCTTGGAAACGGTGCCGCCGTATAAACGGCTACCACTAGCTGGCAAATATAAAACGGCGCTTTACTATATACTCTCCGCCTTTTACACGTCATATATAGGCAGGATATATCTGAACTTAAATTTCAGATTTAGCTTGTTCTTAATGAAGTATTTCCCGTACATGGCTTTCTTTAGATTTGGAATTATTAAATCATATGTTAATATATTTAAAGAGGACCCCATGGACAACGCTGAACTGAAGCCGAACTCAGAGTATGAGAAACCTCAACCTCCTTTGCCACTTTACAAAGAACTATTGTCTTTGATCTGGTGA

Protein sequence:

>DPOGS207067-PA
MSTEVDVYRLTSSLVDEYIEGLFSKDAFEQPSDSKKPEDLEMKLPEWFDEKQFNKARRFYWNNCYMLSSSMLNGLVAVFAIPSILQVLIGTKRSNSPYTAYKRYLSTLLHTLSWFEHELKPGSRSWRSLFTVRTRHFKASLAAKLKNQPLVSQRDLALTQFGFIGFSMIKSDKFGIRQFETGDWEAYNHFWRTIGHLIGLEDKYNLCRKNVDETRQACQILLDRVYTPCLENVPEYFEHVSRAMLEGLWCVNPTVHIDGMLYFTKYLCSVPGYVYTEQDRIELQQKCMKQLNGRSDEIGIETSSLMAKPLIDLPPRKHLIYIRDYDSLETVPPYKRLPLAGKYKTALYYILSAFYTSYIGRIYLNLNFRFSLFLMKYFPYMAFFRFGIIKSYVNIFKEDPMDNAELKPNSEYEKPQPPLPLYKELLSLIW-