Monarch geneset OGS2.0

DPOGS209171
TranscriptDPOGS209171-TA1263 bp
ProteinDPOGS209171-PA420 aa
Genomic positionDPSCF300061 - 74671-76388
RNAseq coverage183x (Rank: top 49%)
Annotation
HeliconiusHMEL0097450.077.35% 
BombyxBGIBMGA011483-TA1e-14674.05% 
DrosophilaNotum-PB5e-8845.71% 
EBI UniRef50UniRef50_E9H1P44e-9840.65%Putative uncharacterized protein (Fragment) n=1 Tax=Daphnia pulex RepID=E9H1P4_DAPPU
NCBI RefSeqXP_002085045.16e-9147.80%GD14589 [Drosophila simulans]
NCBI nr blastpgi|2608087773e-9443.88%hypothetical protein BRAFLDRAFT_275198 [Branchiostoma floridae]
NCBI nr blastxgi|2608087772e-9343.37%hypothetical protein BRAFLDRAFT_275198 [Branchiostoma floridae]
Group
KEGG pathway 
Orthology groupMCL12896 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209171-TA
ATGTTAATTTTTCAGGCGGTGGTCATATCAGTCTGCGAAAGTCTGGTCCAGGCAGACAGTCTGCGACTGGTGTGGCTCACAAACACTTCACTGACCTGTAATGATGGATCACCCGCAGGATATTACATCCGTCGTGGCAGTAACAGCCGTCACTGGGTGTTGTATTTGGAAGGTGGTGGCTATTGTTGGGACGCGGGCTCATGTGGCGCGCGGTGGACGAGACGCCCTGGCCTAATGTCTTCCACACGTTGGCCTCGAGCGCGAAGAGCTCCAGCCTTGCTATCTTCCGACCCCCAAGCAAACCCTCTCTGGCACGCCTCCAATCATGTTTTATTACCGTATTGCTCTAGCGATATGTGGGCAGGAACTCGTCTCCATACAAGAACTAATGGCAGTTTCGCGTTTGTGGGGCACCTTATTGTCCGATCGGTCCTCAATGAACTATTGCACCTAGGCCTCGCGGGCCGTTTGCTACTTGTAGGATCTAGTGCTGGAGGTACGGGTGTCATGCTTCACGCTGACTCTACAAGAAGAACTCTTAGAGCTCACAGTGTACGAGTTGCGGCTATAGCAGATTCTGGATGGTTCTTGGATCGTCCACCAAGAGCGAGACGTGCATCATCAGCTAACGCTGTAGCTCGTTTAGGCCACACATTATGGTTAGGGGCACCACCCAATTCCTGCGTTAGGGATTTCCACGACAAGCCCTGGCTATGCTATTTTGGGTATCGGCTCTACCCTCACATACGCACGCCCCTTTTTGTTTTCCAATATCTTTTTGACTCTGCCCAGCTTACAGCAGAAGGAGTACGCGCTCCTAGGACGAGAGCGCAATGGGACGCCGTTCATGAGACGGGCGCGGCTATTCGGGCTAGCTTGAAGACCGTACGCGCTACCTTCGCGCCTGCATGTATAGCCCACGGCGCCCTCGCACGCCCGGAGTGGCTGGCAATAAATGTGTCGGGCATATCATTGCCAAACGCGATCGCCTGCTGGGAACGCCGGTTCAGAGACGGTAATAGGAAGGAACGCCCTAGATGTGCACCTCGGAGACTGATTGAGCGTTGTTCTTGGCCGCAATGTAACAGTTCGTGTCCTAGACTGCGAGATCCTCGGACTGGTGAGGAAGTCGCTCTGGCGGCTTTGCTACAAAGTTTCGGTCTAGACGTCCGTGGTGCTGCAGCCGCGATGGGTCTTGATGCTCGAGCTTTGTCTCGTATGAGTCGAGCCGAGCTACTGCCACTCTTGGCACCCCACACGTGA

Protein sequence:

>DPOGS209171-PA
MLIFQAVVISVCESLVQADSLRLVWLTNTSLTCNDGSPAGYYIRRGSNSRHWVLYLEGGGYCWDAGSCGARWTRRPGLMSSTRWPRARRAPALLSSDPQANPLWHASNHVLLPYCSSDMWAGTRLHTRTNGSFAFVGHLIVRSVLNELLHLGLAGRLLLVGSSAGGTGVMLHADSTRRTLRAHSVRVAAIADSGWFLDRPPRARRASSANAVARLGHTLWLGAPPNSCVRDFHDKPWLCYFGYRLYPHIRTPLFVFQYLFDSAQLTAEGVRAPRTRAQWDAVHETGAAIRASLKTVRATFAPACIAHGALARPEWLAINVSGISLPNAIACWERRFRDGNRKERPRCAPRRLIERCSWPQCNSSCPRLRDPRTGEEVALAALLQSFGLDVRGAAAAMGLDARALSRMSRAELLPLLAPHT-