Monarch geneset OGS2.0

DPOGS206872
TranscriptDPOGS206872-TA1392 bp
ProteinDPOGS206872-PA463 aa
Genomic positionDPSCF300001 - 2308991-2312119
RNAseq coverage147x (Rank: top 54%)
Annotation
HeliconiusHMEL0102060.074.63% 
BombyxBGIBMGA013149-TA1e-17166.46% 
DrosophilaEs2-PA1e-6939.96% 
EBI UniRef50UniRef50_Q7PXD28e-8842.74%AGAP001347-PA n=5 Tax=Endopterygota RepID=Q7PXD2_ANOGA
NCBI RefSeqXP_321797.41e-8842.74%AGAP001347-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1583021813e-8742.74%AGAP001347-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1583021813e-8742.22%AGAP001347-PA [Anopheles gambiae str. PEST]
Group
KEGG pathway 
InterPro domain[25-377] IPR0191486.7e-72Nuclear protein DGCR14
Orthology groupMCL14168 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206872-TA
ATGAAAATACTTAAAAATGAAAGTTCCCTTTTTAAGGTACCTAAAGTACCACCAATTAAGAGAAAAGTCACGAAAACACATATCTTGGATGAAGAAGATTATGTACAGGGAATAGCACAGATCATACAGAGAGATTTCTTCCCAGATTTAGAAAAACTTAAGGCGCAAAATGATTACTTAGAAGCTTCTGAAAACAAAGACTATGCCAGGCTACGACAAATTGCCAAAAAATACAGTGGGCATAGACCACCTACAGAACCATATAATTCTCCTGCCACATTTGACACACCAAATGCCGATAGACCTTTTTCTCCATCCTCAGCAGAAGCACAATCAATACCTAAAGAAACTGTTGAGACCTTTAAGGATATAACTGATAAACATACTTTAGATTCCTTCCTCGCGCTACATACAAGTGAAGACAATGCAAGCTACAATCGCGTTATAGCACTTGAAAACAAGAAGAAAGCAAGTAAAATTGCTTCCCAGTTGCAGTCAGAGGTCACCTCAGCTATTCAAGCTGACAATGCATTAGCATTGCCATCTTTGGAGCAACAGGCTAATCAAGAAGAGAGACCTCATGAGCTGGACACATGGAGGTATCGGGCCAAGAACTACATAATGTATGTGCCGGACGGGGCTGAATCTCAGCTGCCTAGCCCCAAACCAGAGCTGCAACACCACAACACTAGACTTACAACACAAGTATTTGATTCAGCTAAGAACAAAGAAGCGATAACTGCTTTGGCAAGAAGTCAGGATTCAGCTATCCAAGGCAAGATTGGTGTTGATGGTGTCAGTATAGGCGAGAAGCCGGAGTACAACTTCGTGTCGACTCCATCACCTAGACCAGGAGCTGGTCCGGACCAGTCCCCGCTCATGACCTGGGGAGAGATTGAGGGCACTCCATTCAGGCTGGACGGAGGGGACACGCCTCTACCTGCTGTGGGCGCAGGCATGGCTTATCGTATGCTGGAGTCGGGCTCCCGCGAGAGGATCGCACTGCAGCTGGCGGAGAGAGCTGCGAAGCGGAGACGACCAACAACACCAACACATACCATGAAGACGCCTGGGAGTTTCAGAACCAACACAGAGAGGTTGGCAAGCATGTCGCCAGCGGCAAGGAAATTGGCGGCAAAGCATTTACTGTCGCCACGCTTGAAATTAACACCCAACGCCATGGGTATATCGCATAAAACACCAAAAATAACCCCGTCCCCAGGAACACCGTTGGTGGCAACTCCAAAAACACCATCTTCAGCTAAAACATCTGAGAATCCATCTCAAACTCCAGAGAGCAGTTCACAGACGGATAAAAATCTCACAGATAATCTATTACAGATAAACCTTGCAAAGAGGACAAGGCTAAAAGCACAAGATTTCTTCAAATAA

Protein sequence:

>DPOGS206872-PA
MKILKNESSLFKVPKVPPIKRKVTKTHILDEEDYVQGIAQIIQRDFFPDLEKLKAQNDYLEASENKDYARLRQIAKKYSGHRPPTEPYNSPATFDTPNADRPFSPSSAEAQSIPKETVETFKDITDKHTLDSFLALHTSEDNASYNRVIALENKKKASKIASQLQSEVTSAIQADNALALPSLEQQANQEERPHELDTWRYRAKNYIMYVPDGAESQLPSPKPELQHHNTRLTTQVFDSAKNKEAITALARSQDSAIQGKIGVDGVSIGEKPEYNFVSTPSPRPGAGPDQSPLMTWGEIEGTPFRLDGGDTPLPAVGAGMAYRMLESGSRERIALQLAERAAKRRRPTTPTHTMKTPGSFRTNTERLASMSPAARKLAAKHLLSPRLKLTPNAMGISHKTPKITPSPGTPLVATPKTPSSAKTSENPSQTPESSSQTDKNLTDNLLQINLAKRTRLKAQDFFK-