Monarch geneset OGS2.0

DPOGS213953
TranscriptDPOGS213953-TA1323 bp
ProteinDPOGS213953-PA440 aa
Genomic positionDPSCF300226 + 48673-50159
RNAseq coverage2502x (Rank: top 5%)
Annotation
HeliconiusHMEL0152801e-4545.30% 
BombyxBGIBMGA003370-TA3e-2853.42% 
DrosophilaCG31626-PA5e-0840.67% 
EBI UniRef50UniRef50_G0QZU54e-0842.53%Induced during granule regeneration 1, putative (Fragment) n=1 Tax=Ichthyophthirius multifiliis strain G5 RepID=G0QZU5_ICHMG
NCBI RefSeqNP_724321.11e-0640.67%CG31626, isoform B [Drosophila melanogaster]
NCBI nr blastpgi|3879975876e-1041.94%extracellular metalloproteinase, serralysin family [Pseudomonas fluorescens SS101]
NCBI nr blastxgi|1984629607e-3944.58%GA28529 [Drosophila pseudoobscura pseudoobscura]
Group
KEGG pathway 
Orthology groupMCL34998 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213953-TA
ATGAACGCGGTGGTGTTGTTATCGTTGGTAGCGGCTGTAGTGGGCGAGGCACCTTATAACTTACCCCGGCCAGCACCTCATGCGCCAGCCATCAATTACCAACCTATTCCAGTACAGCCACAGCCGAATCCTCAACCACAGGCTCCTCAATACCCCTCTAATGGATACAGTTATCAAAGACCAAGCAGTCCTATTCTAATTCCTTCTGACCCAGTTCCACAGCCTCAATATCCTCAACCAATTAGACCAGTACCGCAACCTTTACCCCATTACCCGCAACCACAGCCTGTACCTCATTACCCGCAACCACAGCCTGTACCCCACTACCCGCAACCACAGCCTGTACCGAATTACCCTCAACCAATAAGACCAGCACCACAACCTGTGCCTCACTACCCGCAACCACAACCCGTACCTTACTACCCTCAACCACAACCCGTGCCTCACTACCCTCAACCAATTAGGCCAGCACCACAGCCGGTGCCCCACTATCCTCAACCACAACCTGCCCCTTACTTCCCACAACCTCAACCGCAACCTCTACCTCACAATCCTCAACCAATAGCACCACAGCCGCAACCTATTTCACCACCGGGCACATCTTACGGAGTGCCAGCCCCCATACAGCCATACCAGCCTCTTCCCTCAGTCATTCCTCAGCCTCAACCAGCTCCTATCCTATACGAGCAACAACAAGTACAATACCAACAGCAGCAGCAACAACAACAAGTTCAATACCAACAGCATCAGCAACAACAGCAAGTACAATACCAACAGCAACAGGTGGTGGAGCAGCAAGTACAACAACAACAACAGCCACAACAACTCTTACCTCAAGGATACCTTTACCCACAAGGACAACCTGCCCAACAAGTGTCAGTAAAATCTCAAGAGTCGAGTTCCGCAGTGTCAAGCTACCAGGCCGACTCCATTCAGTACAGCAACGCCCAACAAGATCAGGTTCACAGAGAAGCTTTAGACAACGTCGTTGTTGGTCGAGTACAAGATGTTATAAAGGACAGCGAACACTCATCAGCTAAGGAGTCAGGATATGTATCCCTCGTTTCTGGAGTGTCTTTGGGAGAAGCTAAGCCAAGCAGAGAAATCTTATCTTTTGTGCACAGTTCGCCAGTGTCTGGCCAATCTAGTCAAAGCTCCCAATCATCTTCATCTTCATCATCTTCATCATCATCATCATCATCATCATCATCATCAGTAGAACAACAAAATGATTCTGCGGATTACAGTCTGTCGTCTGATGAATTAAGGGGGGCATTCTTGCCGGAAAACAAACCGGCGTCGTCATACGGTTTACCAAATTAA

Protein sequence:

>DPOGS213953-PA
MNAVVLLSLVAAVVGEAPYNLPRPAPHAPAINYQPIPVQPQPNPQPQAPQYPSNGYSYQRPSSPILIPSDPVPQPQYPQPIRPVPQPLPHYPQPQPVPHYPQPQPVPHYPQPQPVPNYPQPIRPAPQPVPHYPQPQPVPYYPQPQPVPHYPQPIRPAPQPVPHYPQPQPAPYFPQPQPQPLPHNPQPIAPQPQPISPPGTSYGVPAPIQPYQPLPSVIPQPQPAPILYEQQQVQYQQQQQQQQVQYQQHQQQQQVQYQQQQVVEQQVQQQQQPQQLLPQGYLYPQGQPAQQVSVKSQESSSAVSSYQADSIQYSNAQQDQVHREALDNVVVGRVQDVIKDSEHSSAKESGYVSLVSGVSLGEAKPSREILSFVHSSPVSGQSSQSSQSSSSSSSSSSSSSSSSSSVEQQNDSADYSLSSDELRGAFLPENKPASSYGLPN-