Monarch geneset OGS2.0

DPOGS202216
TranscriptDPOGS202216-TA1026 bp
ProteinDPOGS202216-PA341 aa
Genomic positionDPSCF300149 + 228339-232534
RNAseq coverage471x (Rank: top 26%)
Annotation
HeliconiusHMEL0092082e-15584.12% 
BombyxBGIBMGA013509-TA6e-14780.99% 
DrosophilaVdup1-PD6e-13165.06% 
EBI UniRef50UniRef50_Q0E8K89e-12965.06%Vitamin D[[3]] up-regulated protein 1, isoform A n=36 Tax=Arthropoda RepID=Q0E8K8_DROME
NCBI RefSeqXP_001844423.17e-13969.43%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700331121e-13769.43%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastxgi|910927221e-13870.76%PREDICTED: similar to AGAP002691-PA [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[165-337] IPR0147561.6e-29Immunoglobulin E-set
[8-145] IPR0110211.4e-21Arrestin-like, N-terminal
[184-311] IPR0110221.2e-17Arrestin-like, C-terminal
Orthology groupMCL15624 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202216-TA
ATGCCTCGGAAACTGCTAAAATTTTTAATCGTCTTCGACAACACGTCGCTCTTATATTTTCCTGGTCAATTCTTATCAGGAAAAGTTCTGATGGAGTTACAAGACGACACTCCAGTACTTGGTCTACATTTCCATGTAATCGGTGAAGGTGTTGTAAGAGTTGGATCCGGCAGACATGAGAGGCTGTTTGACAAGGAAAACTATATTGACTTCAGAATGAGGCTTCTGGGTGAACCAGGTCATGGAGCATCCGTCCTTTCTCCGGGGATCCATAGCTTTCCATTTAAGTTGGGGCTTCCTATGGGCCTTCCGTCCACATTCCTCGGTACTCATGGATGGGTCCAGTACTATTGCAAGGCCGCACTAAGAGAACCAAATGGTCTCACTCATAAAAATCAGCAAGTCTTTATAGTAATGAATCCCATTGACCTGAATTCTGAACCGCCAGTCCTAGCTCGATGTGACCCTGTAGTTATTTCTATGTGGCGTGTGCAGCAAGAGTTTGAGCTGAGTGTGGAGCACAAGTTGGGTGTGGGGTGTGTCGGCGGCGGGGTGGTTCAGTGTCGCGTGTCCCTGGACCGGGGCGCGTACGTGCCCGGGGAAAGCGTCGCATTATCCGCCGTCGTCGACAATAGATCGCGGACACTCATCAAGGCTACCAGAGCGGCGTTGACAGAGACGATCCAGTACGTGGCTCACGGCAAGGTGGCGGCGCGGGAGGTGCGGGAGTTGGCGCGCGTGGCGCACGGCCGGGTGCGGGGCGGGCAGGCCCAGCGCTGGCGGGACGTGCTGTACGTGCCGCCGCTGCCACCCACCAACCTCAGAGGCTGTCATCTCATATCCGTGCAGTACGACGTCTTTTTCATCCTTGAACCGAAGAGTCTCGAAAAGGAAGTGAAACTCCAACTTCCGGTACTGCTAGGCACATACCCATTCAGAGACGACGACGCGGAACGCCCGCCCACGCACTACCCCACCACGCTGCCCATCTTCAGGCCCTGGCTGGCTGACAAACAGTCGCAGTGA

Protein sequence:

>DPOGS202216-PA
MPRKLLKFLIVFDNTSLLYFPGQFLSGKVLMELQDDTPVLGLHFHVIGEGVVRVGSGRHERLFDKENYIDFRMRLLGEPGHGASVLSPGIHSFPFKLGLPMGLPSTFLGTHGWVQYYCKAALREPNGLTHKNQQVFIVMNPIDLNSEPPVLARCDPVVISMWRVQQEFELSVEHKLGVGCVGGGVVQCRVSLDRGAYVPGESVALSAVVDNRSRTLIKATRAALTETIQYVAHGKVAAREVRELARVAHGRVRGGQAQRWRDVLYVPPLPPTNLRGCHLISVQYDVFFILEPKSLEKEVKLQLPVLLGTYPFRDDDAERPPTHYPTTLPIFRPWLADKQSQ-