Monarch geneset OGS2.0

DPOGS207066
TranscriptDPOGS207066-TA1266 bp
ProteinDPOGS207066-PA421 aa
Genomic positionDPSCF300001 + 2290501-2297215
RNAseq coverage340x (Rank: top 34%)
Annotation
HeliconiusHMEL0102024e-12250.99% 
BombyxBGIBMGA013113-TA2e-12748.32% 
DrosophilaCG31344-PA4e-4930.23% 
EBI UniRef50UniRef50_B0WPZ31e-5040.68%Putative uncharacterized protein n=2 Tax=Culicinae RepID=B0WPZ3_CULQU
NCBI RefSeqXP_001850777.12e-5140.68%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700464483e-5040.68%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastxgi|1700464483e-4940.68%conserved hypothetical protein [Culex quinquefasciatus]
Group
KEGG pathway 
Orthology groupMCL34691 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207066-TA
ATGGACTTGACAGTGGATGAGTTTATAGAAGGTTTGTTTAGCGATGAAGCTCACGAAAGGCCATCTGATAATATCTCACCGGAGGATTTCGAGTTCAAACTGCCTGAATGGTTTGATGAGAAAAAATATAAGCAGGGGCAGAGATTTTATCGGGACTATTCATTCATGTTATCATTGTCTTTAATGCCTGGCTTAGTTAGTGTATTCGCTATTCCATCCATTTTGAAAGTCCTCTGCGGAAGCCGACGTTCCAACTCGAGGTTTACCGCTTACAGGCGATACATTTCAACATTCTCGCATATATTCACGTGGTATACTAAAGAGCTGAAGCCGGGATCCTTGTCGTGGAAATCCTTACAAACTGTAAGAATCAGACATTTTCGTGCTAGTCGTGCAGCAAAGATGAAAGGACAAGGTATTGTGTCACAGAGGGATATGGCTTTGACTTTGTTTGGGTTTATTGGCTTCGTCATATTGAAGCCAGATATATTTACTGTGACGCAATTAGAAGTAGGAGATTGGGAAGCTTTCAATCATATGTTAGGAGTAATCGCGCATATGATAGGTTTGGAAGATAGGTATAATATTTGCCGAGCGACAGTTCAAGAGACTAGGGAAGTCTGTAAGCAAATATTGGATCGCGTTTTCACACCTTGTTTGGACAATGTGCCAGAATATTTTGAACACATGTCGCGTGTAATGATCGAAGGTTTGAACGCTTCGCTCACTCCCTTAGAATCCAATTCATTGATATACAAGGCCAAGTATCTAGCCAATGTCCCTGGGTACATCACTACCGAGGAAGAGAGGATCGCTTTACAAGAGAGGATCAAAAAGTGTTTGAGAGGAAGATCACTTGATGAAGGTGTTGATTCCAGTATGTTAATAGAAAAGTCAGCTATAGATGGTCTTATGAAGAACACGAAGAGGATCCTGTATTACCACGATTACGATACTTTGGAGTCAGCGCCAACATACAAAAGCTTACCTTTTGTCGCTAAATGGAAGCTTACTTTACTTGACATTCTAAGGGAGATCCACGCTTTATATATAGGGCGCATTTTTCTTAACTTTTACATAAAGTGCTTATTATTCCTCGCTTTTTATTTCCCGTATATCGCTATATGGAGATACGGAATGCAGTATTACGTGGACATGTTCAAAGACAGCCCCGTCGATGATACAGAATTGATCCCAAATTCTGAATATAATAAGCCCCAGCCTCCAGAGCCCTGGTACAAAGTACTATACGGCATTTTCTGGTAG

Protein sequence:

>DPOGS207066-PA
MDLTVDEFIEGLFSDEAHERPSDNISPEDFEFKLPEWFDEKKYKQGQRFYRDYSFMLSLSLMPGLVSVFAIPSILKVLCGSRRSNSRFTAYRRYISTFSHIFTWYTKELKPGSLSWKSLQTVRIRHFRASRAAKMKGQGIVSQRDMALTLFGFIGFVILKPDIFTVTQLEVGDWEAFNHMLGVIAHMIGLEDRYNICRATVQETREVCKQILDRVFTPCLDNVPEYFEHMSRVMIEGLNASLTPLESNSLIYKAKYLANVPGYITTEEERIALQERIKKCLRGRSLDEGVDSSMLIEKSAIDGLMKNTKRILYYHDYDTLESAPTYKSLPFVAKWKLTLLDILREIHALYIGRIFLNFYIKCLLFLAFYFPYIAIWRYGMQYYVDMFKDSPVDDTELIPNSEYNKPQPPEPWYKVLYGIFW-