Monarch geneset OGS2.0

DPOGS210600
TranscriptDPOGS210600-TA1908 bp
ProteinDPOGS210600-PA635 aa
Genomic positionDPSCF300168 - 29343-32536
RNAseq coverage199x (Rank: top 47%)
Annotation
HeliconiusHMEL0058970.085.71% 
BombyxBGIBMGA014415-TA7e-5281.30% 
DrosophilaCG5222-PA0.051.76% 
EBI UniRef50UniRef50_Q95TS50.051.76%CG5222 n=30 Tax=Neoptera RepID=Q95TS5_DROME
NCBI RefSeqXP_002433242.10.052.42%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|2420256600.052.42%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastxgi|3504084200.053.07%PREDICTED: integrator complex subunit 9-like isoform 2 [Bombus impatiens]
Group
KEGG pathway 
InterPro domain[307-428] IPR0227121.1e-11Beta-Casp domain
Orthology groupMCL14249 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210600-TA
ATGAAACTGTATTGTTTAAGTAGCGACGCGGCCAAGCCATGCTTCGTGTTATCGTTCAAGGAGCTCTTGATCATGTTAGACTGTGGATTGTCAGCGCACTCCGTGTTGAACTTCCTTCCACTACCACCAGTACCAAGTACAAGACTTGCCTCACTGCCCAACTATACTCCACCACATGTCAACGACCCTCTACTAGAAGGGGAACTCAAGGAATGCTGCGGTCGTGTCTTTGTGGATAGTGTACCAGAATTCTGTCCACCCCTCGATAAGGTGGTAGACTTCTCCCAGCTGGACGTCATTCTAATATCCAACTACACATGTATGATGGCTCTGCCCTTCATAACGGAAGACACGGGTTTTAAAGGACAAGTTTACGCGACCGAGCCCACTCTCCAGATAGGCAGGTTCTACCTCGAGGAGCTCTCGGAGTGGGTGTCGGGGAGCGGCGGCGGCGCGGGCGCGGCCAAGAGATGGAAGGAGCTCGTACACTTGTTACCGCCGCCTCTGGCCTCTGCGCTCCGGCCGCGAGCCTGGCGTCGCCTGTTCTCGCCCGGGGCTCTAGCGCGCGCTCTGTCGAGGGTTCGGGTCGTGGGCTACGACGAGCGAGTCGACATCTACGGTGCGCTCGACGCCACAGCCGTCAGCTCGGGGTTCTGCCTCGGCTCCGCGAATTGGGTTCTGCGGTCGGCGCACGAAAAGGTGGCTTACGTGAGCGGCTCCAGCACCCTGACCACTCACCCGCGACCCATCAACCAGGCTGCGCTGCGAGGCGCCGATCTCCTGGTGCTGGCCGCCCTGACGCAGACTCCGGCGCACAACCCCGACCACATGTTGGGAGACCTGTGCGTGCACGCCACCGTGACACTGCGGGCGGGCGGCTCCGTGCTGTGTCCGGTGTACCCGAGCGGCGTGCTCTACGACTTGTTGGAGTGTCTCTCGGCTCACCTGGAAGGCGCGGGCCTAGCTCACGTGCCGCTGTACGTGGTCTCGCCCGTCGCCGACTCCTCCTTGGCTTATAGTAACATCCTCGCGGAGTGGGTATCGGTGGGTAAGCAGGCGCGCGTCTACCTCCCCGAGGAGCCATTTCCTCACGCGGCACTCGTCCGCGCGGGCCGCCTCAAGCACGCCCGCTCCCTACACGACGACGCCTTCAGCGCGGACTTCCGTCAGCCCTGCGTCGTATTCTGCGGTCATCCGAGTCTGCGGTTCGGAGCGGCCGTCCACCTCGTTGAGCTCTGGGCGAACAATCCCGCTCACGCCATAATATTTACCGAGCCGGACTTCCCTCACGCTGAGGCGCTCGCCCCCTTCCAGCCACTGAGCATGAAGGCCTTCCACTGTCCGATAGACACGTCCCTCAACTACTCACAGGCCAACAAGCTGGTCCGCGAGCTGCGGCCGCGCGAGTTGGCCCTGCCCGAGCAGTATGCGGCGTCCGGCGGGACGGCGGCGGGCGGCGGGGCGGCGGCGAGCGGCGGGGCAGGCGGAACGAGACCTCACATCGGCGCTGACGTGCCGACTGTGGTGGTCCGGCGCGGAGCCGCGCGGTCTCTGGGCCTCCGGGCCGGTCTGCGCGCAGCGCCCCTGACAGCCGCCTTGCGCGTGCGTGACGCGCGCCTCGAGCTTGTAGCACCGGCGGCGTGCGGAACTCCCGGCACGGAGGCAGCCCCGGCGCCCGTCCTACACTGGAGCGCCCTGGACGTGGAAGCGCTGGTGCGGGCGCTGGCAAGGGAGGGCGTGTCGGAGGCGCGGGTAGAGGCGGGCGCGGACGGCTGTATAGTGCATCTCCCGCGACACGACACGCTGGTCCACGTCGAGCGACACGCCACTCACGTGTTCTGCGAGGGTCGCTCGGACGTGCGTCAGGCGCTGAGACGGGCGCTGGCCGCGTGTCTGCCACACATCTAA

Protein sequence:

>DPOGS210600-PA
MKLYCLSSDAAKPCFVLSFKELLIMLDCGLSAHSVLNFLPLPPVPSTRLASLPNYTPPHVNDPLLEGELKECCGRVFVDSVPEFCPPLDKVVDFSQLDVILISNYTCMMALPFITEDTGFKGQVYATEPTLQIGRFYLEELSEWVSGSGGGAGAAKRWKELVHLLPPPLASALRPRAWRRLFSPGALARALSRVRVVGYDERVDIYGALDATAVSSGFCLGSANWVLRSAHEKVAYVSGSSTLTTHPRPINQAALRGADLLVLAALTQTPAHNPDHMLGDLCVHATVTLRAGGSVLCPVYPSGVLYDLLECLSAHLEGAGLAHVPLYVVSPVADSSLAYSNILAEWVSVGKQARVYLPEEPFPHAALVRAGRLKHARSLHDDAFSADFRQPCVVFCGHPSLRFGAAVHLVELWANNPAHAIIFTEPDFPHAEALAPFQPLSMKAFHCPIDTSLNYSQANKLVRELRPRELALPEQYAASGGTAAGGGAAASGGAGGTRPHIGADVPTVVVRRGAARSLGLRAGLRAAPLTAALRVRDARLELVAPAACGTPGTEAAPAPVLHWSALDVEALVRALAREGVSEARVEAGADGCIVHLPRHDTLVHVERHATHVFCEGRSDVRQALRRALAACLPHI-