Monarch geneset OGS2.0

DPOGS209122
TranscriptDPOGS209122-TA4971 bp
ProteinDPOGS209122-PA1656 aa
Genomic positionDPSCF300440 - 50778-56063
RNAseq coverage52x (Rank: top 70%)
Annotation
HeliconiusHMEL0084402e-1429.10% 
Bombyx% 
DrosophilaMuc11A-PA6e-0636.36% 
EBI UniRef50UniRef50_P327688e-0839.86%Flocculation protein FLO1 n=39 Tax=Saccharomyces RepID=FLO1_YEAST
NCBI RefSeq%
NCBI nr blastp%
NCBI nr blastxgi|2211101890.035.25%PREDICTED: hypothetical protein [Hydra magnipapillata]
Group
Gene OntologyGO:00080619.2e-11chitin binding
GO:00060309.2e-11chitin metabolic process
GO:00055769.2e-11extracellular region
KEGG pathway 
InterPro domain[1560-1620] IPR0025579.2e-11Chitin binding domain
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209122-TA
ATGTTCAAGCCATATCTCCAGACTTGCCCGACAGGATTGCTCTACGATACTAAAACTAATAACTGCCAGCCATGGCCAGTAGCAATGTGCTGCAACTATGTATCTCCACGTCCATTTCATCCTTTACCTCCTGTATCTCCATTGGAACCTGGTAACACAACTAAAATACGAACTCTACCTCCTATCTCAATCACAACTCCATCAACTTTAACGGACACACCATCCGTTCCAACCGACGTAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCCGTTCCAACCGATATAACGCATCCAACGAGTGAAACGCCCGTGATAACAACATCACATCCATTGCCGACGACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGAGTGAAACGCCCGTGATAACAACTTCACATCCATTGCCGACAACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCTGTTCCAACCGATGTAACGCATCCAACGAGTGAAACGCCCGTGATAACAACTTCACATCCATTGCCGACAACAAGTTCTAGCCCCAATACTCCATCAACTTTAACGGACACTCCATCTGTTCCAACCGACGTAACGCATCCAACGAGTGTAAATCCTACTCAAACAACCAAACCGAACATTAAAACCTTACCGCCCACTGCCACAACGACTACGAAACCCAGCACATCTACAAGTGAAACGCCTGGTTTACGTACCAATTCTGATAACATTACTGAATCTACTTTGACAGTTACGGCAGTTGAACATTCCACAACAACCCAAGCAACAATACCTCACTCTACTAATAAAACAACAGAGTCCAAAACTTATACTCCAAACAATGAACGTACAACAAGAGAAGTTAGTTCATTACGTACAGTAAGGACATTACCAACAAAGAAATCCAAAAGGACTCTTGCTCCTTATTCAACAACTACCAACAGCCCTTGTCATTTCACACCAATGATTACTGAAAAACCAATCAAACCTCAACTTCCCGAATGTGGTGAAGATGACAGATTCGTTTATCCCGACTTCGAAAACTGTAATTTTTACTTCAACTGCTTCAGTGGAGTCATGAAGAGACTTCCTTGCAAAATTAACTTTGCATTCAGTCCTCATGTATTGAGGTGCGTCCCTGTAGAGAAAGTGAACTGTAATTCATACAAATCAGCTCTAGAGCAGAGACTATATTCAATTAACTTCGTCCTCTTCAACGCTCTTTCGAAAGTATCGAACTCGCAATCCAAGAGAAAAAATGATGAATAA

Protein sequence:

>DPOGS209122-PA
MFKPYLQTCPTGLLYDTKTNNCQPWPVAMCCNYVSPRPFHPLPPVSPLEPGNTTKIRTLPPISITTPSTLTDTPSVPTDVTHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDITHPTSETPVITTSHPLPTTSSSPNTPSTLTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDVTHPTSETPVITTSHPLPTTSSSPNTPSTLTDTPSVPTDVTHPTSVNPTQTTKPNIKTLPPTATTTTKPSTSTSETPGLRTNSDNITESTLTVTAVEHSTTTQATIPHSTNKTTESKTYTPNNERTTREVSSLRTVRTLPTKKSKRTLAPYSTTTNSPCHFTPMITEKPIKPQLPECGEDDRFVYPDFENCNFYFNCFSGVMKRLPCKINFAFSPHVLRCVPVEKVNCNSYKSALEQRLYSINFVLFNALSKVSNSQSKRKNDE-