Monarch geneset OGS2.0

DPOGS209626
TranscriptDPOGS209626-TA1155 bp
ProteinDPOGS209626-PA384 aa
Genomic positionDPSCF300015 + 827677-832114
RNAseq coverage120x (Rank: top 58%)
Annotation
HeliconiusHMEL0170222e-13975.00% 
BombyxBGIBMGA011141-TA1e-16476.92% 
DrosophilaCG42749-PC5e-13259.84% 
EBI UniRef50UniRef50_Q8MS323e-10465.00%RE24790p n=46 Tax=Pancrustacea RepID=Q8MS32_DROME
NCBI RefSeqXP_001602613.17e-13763.10%PREDICTED: similar to ENSANGP00000012390 [Nasonia vitripennis]
NCBI nr blastpgi|1565451401e-13563.10%PREDICTED: hypothetical protein LOC100118714 isoform 1 [Nasonia vitripennis]
NCBI nr blastxgi|1565451402e-14363.10%PREDICTED: hypothetical protein LOC100118714 isoform 1 [Nasonia vitripennis]
Group
Gene OntologyGO:00190284.4e-21viral capsid
KEGG pathway 
InterPro domain[32-248] IPR0043024.4e-21Chitin-binding, domain 3
Orthology groupMCL15438 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209626-TA
ATGGAGATAAGAAGGAGGAACGGCACGTCGTTTTTAACATACTGGATATTTTTACTACAAATGTTAGGTAGTGGTCGTCGTGTCGCCGAAGGTCACGGTCGCCTCATGGATCCACCAGCGAGGAATTCCATGTGGAGATTCGGTTTCCCCAATCCCGTTAATTACAATGATAACGAGTTATTCTGCGGCGGTTATGCCGTCCAGTGGGAACAGAACGGAGGAAAGTGCGGGGTGTGCGGTGACGCGGAGCACCTCAGCGAACCTCGTCCTCATGAGGCTGGCGGCATGTATGGCAAAGGAATCATAACACGTCATTATAGTGTGGGGCAGGAAATTGAGGTAGAAGTGGAGCTGACAGCAAACCACCTCGGCGCGTTTGTGTTGAAGCTGTGTCCCAACAATAATCCTAATCAGGAAGCCACGCAAGAATGCTTTGATAGGTACCCACTTTTCATTTCCGGTACAAGAGAGGACAGGTTCTTGATTCCTCTTGACACAGCGAAGAAGGACACTTTCAGATACAGAGTTCGATTACCGCCCTACGTCACTTGCACACAGTGTGTCCTGCAATGGACATATTATACTGGCAACGTTGCTTGCTACAACAAGCTGGTTAAAATAAAAATGCAACGCAATGTGGATCATGTGAAGTGCATTACAGGCAACATGTGGGGTATCTGTCCGAACGGGACGGAGGCGGTCGGCTGTGGCCGCTCCGAAACCTTCCGTAACTGCGCCGACGTGGCCGTCATCACCAACACGGGTGGTCTGCCCCCGGCCTTCGCTGACGGCCTCCGACGGGACAACCCCTTCCTCCTCTACTACAGGGACTACAACATGCCGCAAAACGTCTTCCCTCTCGTCGTCAGCAACGATATTGATAGTGAAACAGAGGAGGATATAAGTCCCTTCGTCATTAGGGAACAGGTGTGTGTTCCGTCGGAATCGTACCGCGTGATCCCGGGGATGTTGAGCTGGTGTCAGACCAACTGTCTCCGCTATCCTCCCAACTGCCCCGACGCACTCTGCCACTGCCCACAAGTGTGCGAGGCGATAGGCGAGTTGGCTGGCAGGGAGGGTGCGGACGTGTATTGTATGGACCAATGCATCGTGTATCCGCCGCGTTGCCCCAAGAAACGCTGCTCATGCTACTAA

Protein sequence:

>DPOGS209626-PA
MEIRRRNGTSFLTYWIFLLQMLGSGRRVAEGHGRLMDPPARNSMWRFGFPNPVNYNDNELFCGGYAVQWEQNGGKCGVCGDAEHLSEPRPHEAGGMYGKGIITRHYSVGQEIEVEVELTANHLGAFVLKLCPNNNPNQEATQECFDRYPLFISGTREDRFLIPLDTAKKDTFRYRVRLPPYVTCTQCVLQWTYYTGNVACYNKLVKIKMQRNVDHVKCITGNMWGICPNGTEAVGCGRSETFRNCADVAVITNTGGLPPAFADGLRRDNPFLLYYRDYNMPQNVFPLVVSNDIDSETEEDISPFVIREQVCVPSESYRVIPGMLSWCQTNCLRYPPNCPDALCHCPQVCEAIGELAGREGADVYCMDQCIVYPPRCPKKRCSCY-