Monarch geneset OGS2.0

DPOGS207948
TranscriptDPOGS207948-TA5397 bp
ProteinDPOGS207948-PA1798 aa
Genomic positionDPSCF300090 - 134588-152786
RNAseq coverage100x (Rank: top 61%)
Annotation
HeliconiusHMEL0221932e-7878.12% 
BombyxBGIBMGA014040-TA2e-2733.91% 
DrosophilaCG10625-PH1e-3754.81% 
EBI UniRef50UniRef50_E2BQW15e-8836.88%Putative uncharacterized protein n=2 Tax=Coelomata RepID=E2BQW1_HARSA
NCBI RefSeqNP_001165114.12e-6350.41%collagen [Bombyx mori]
NCBI nr blastpgi|3072025802e-8736.88%hypothetical protein EAI_14096 [Harpegnathos saltator]
NCBI nr blastxgi|2912258620.047.14%PREDICTED: zonadhesin-like, partial [Saccoglossus kowalevskii]
Group
Gene OntologyGO:00423024e-07structural constituent of cuticle
KEGG pathwaycin:1001867753e-14 
 K06237 (COL4A)maps-> Small cell lung cancer
    Pathways in cancer
    Amoebiasis
    Focal adhesion
    ECM-receptor interaction
InterPro domain[71-119] IPR0006184e-07Insect cuticle protein
Orthology groupMCL18454 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207948-TA
ATGGCTCCGGGACTCCTCTTTTATTTTCTGGTGTCTCTGGTAGTAGCGCATGCTACCGAAGATGCTAATGAAACGAGAGCATTGATTCAAGAAGACGCCCACGAGGCTCGAGATTACGGAACGTATGGCGATAACAGTAAGGAGACCGTAGTTAATATTGAGGACGATGAAAAAACACAGTACTATGAAACCAACTACGACACTAGTGCATACGGATTCGGTTACGATGTAGGCCCCAACGGTCAATTTCACCATGAAAATAAAGGCCCCGATGGTGTGACTTACGGTTGCTACGGCTACGTTGACCCCGACGGTTACCTTCGCGTCACACACTACGTCGCTGATAGCCACGGCTACAGAATTATAGAACCCGAAAAACCTGTGGAAGTTTTCCCAGAGGAAAACCACGAATACGATGAAAATTTGGTGACTCCGAGTCCTCTTCCTGGTCAGATAGTACCATGGAAGAAGCTATACATGCCACGAGGATGTGGTAAAACTCCTGGTGGAATTCCTCCTCGCCCTCTACCAAAACCGAAACCGACAAGCCCACCTCGTCCACCACCAGATAGCGCTGGACAAAACAGCAACCCAAAACCTGGTGTTGTGTATCCAGGAGGACAAGGTGGTTACTATCCTGGCACGCCTGGTACCTCTGGTTCCCCTGGTACACCCGGCAGCCCGGGTATACCTGGCAGACCCGGTAGCCCTGGAACCCCAGGCAGTCCTGGCGGACCCGGAGGTCCATCCGGACCATCTGGCCCAGGAGGACAAAATTCAGGCTATTACCCCGGTCAAGGACAAGGTGGCTCTTACCCCGTGAGACCTGGAGCACCCGGAAGCTCTGGAGCACCCGGAAGCCCTGGAGCACCAGGAAGCCCCGGAGCACCTGGCGGTCCTGGCGGACCCGGTGGACCCGGAGGTCCATCCGGACCATCTGGCCCAGGAGGACAAAACTCAGGCTATTACCCAGGTCAAGGACAAGGTGGCTCTTACCCCGTGAGACCTGGAGCACCCGGAAGCTCTGGAGCACCCGGAAGCCCTGGAGCACCAGGAAGCCCCGGAGCACCTGGCGGTCCTGGCGGACCCGGTGGACCCGGAGGTCCATCCGGACCATCTGGCCCAGGAGGACAAAACTCAGGCTATTACCCAGGTCAAGGACAAGGTGGCTCTTACCCCGTGAGACCTGGAGCACCCGGAAGCTCTGGAGCACCCGGAAGCCCTGGAGCACCAGGAAGCCCCGGAGCACCTGGCGGTCCTGGCGGACCCGGTGGACCCGGAGGTCCATCCGGACCATCTGGCCCAGGAGGACAAAACTCAGGCTATTACCCCGGTCAAGGACAAGGTGGCTCTTACCCCGTGAGACCTGGAGCACCCGGAAGCTCTGGAGCACCCGGAAGCCCTGGAGCACCAGGAAGCCCCGGAGCACCTGGCGGTCCTGGCGGACCCGGTGGACCCGGAGGTCCATCCGGACCATCTGGCCCAGGAGGACAAAACTCAGGCTATTACCCCGGTCAAGGACAAGGTGGATCTTACCCGGTTAGACCAGGAGCACCCGGTAGCCCGGGAGCACCAGGTGGTCCCGGTGGACCCGGCGGTCCCGGTGGTCCCGGAGGACCAGCAGGACCAAGTGGCCCTAGCACACCTCCATCGCAAGGACCTGGTAGCGGATATTATCCCGGACAAGGACAAGGTGGGTACTACCCTAGCGGTCCTGGATCTCCCGGTCAACCGGGAAGTCCTGGCAGCCCCGGTAGCCCTGGCAGCCCTGGCAGCCCCGGTAGCCCTGGATCACCAGGTGGACCAGGTGGATCTTATGTACCCAGTGGACCTAGTGGACCGGTCGGACCCAATGGCCCGTATGGACCCAATAGACCGAGTGGACCAAATCAACCATCAGGCCCCGGTGGACAAATTGGACCCGAACAAGGTGGATCTTACCCCGTCAGACCTGGTGCACCTGGTAGCCCTGGAGCACCTGGTGGACCCGGTGGACCAGGAGGGCCTGGCGGTCCCGGTGGACCAGCTGGACCAAATGGACCCGGCGGACCCAATGGACCGAATAGACCCAATGGACCCAATGGACCGAGTGGGCCAAATGGACCAAGTGGACCTAATCAACCTTTAGGACCTGGTGGACAAACTGGACCTGGACAAGGTGGATCTTACCCGGTTAGACCAGGAGCACCCGGTAGCCCGGGAGCACCAGGTGGTCCCGGTGGACCCGGCGGTCCCGGTGGTCCCGGAGGACCAGCAGGACCAAGTGGCCCTAGCACACCTCCATCGCAAGGACCTGGTAGCGGATATTATCCCGGACAAGGACAAGGTGGGTACTACCCTAGCGGTCCTGGATCTCCCGGTCAACCGGGAAGTCCTGGCAGCCCCGGTAGCCCTGGCAGCCCTGGCAGCCCCGGTAGCCCTGGATCACCAGGTGGACCAGGTGGATCTTATGTACCCAGTGGACCTAGTGGACCGGTCGGACCCAATGGCCCGTATGGACCCAATAGACCGAGTGGACCAAATCAACCATCAGGCCCCGGTGGACAAATTGGACCCGAACAAGGTGGATCTTACCCCGTCAGACCTGGTGCACCTGGTAGCCCTGGAGCACCTGGTGGACCCGGTGGACCAGGAGGGCCTGGCGGTCCCGGTGGACCAGCTGGACCAAATGGACCCGGCGGACCCAATGGACCGAATAGACCCAATGGACCCAATGGACCGAGTGGGCCAAATGGACCAAGTGGACCTAATCAACCTTTAGGACCTGGTGGACAAACTGGACCTGGACAAGGTGGATCTTACCCGGTTAGACCAGGAGCACCCGGTAGCCCGGGAGCACCAGGTGGTCCCGGTGGACCCGGCGGTCCCGGTGGTCCCGGAGGACCAGCAGGACCAAGTGGCCCTAGCACACCTCCATCGCAAGGACCTGGTAGCGGATATTATCCCGGACAAGGACAAGGTGGGTACTACCCTAGCGGTCCTGGATCTCCCGGTCAACCGGGAAGTCCTGGCAGCCCCGGTAGCCCTGGCAGCCCTGGCAGCCCCGGTAGCCCTGGATCACCAGGTGGACCAGGTGGATCTTATGTACCCAGTGGACCTAGTGGACCGGTCGGACCCAATGGCCCGTATGGACCCAATAGACCGAGTGGACCAAATCAACCATCAGGCCCCGGTGGACAAATTGGACCCGAACAAGGTGGATCTTACCCCGTCAGACCTGGTGCACCTGGTAGCCCTGGAGCACCTGGTGGACCCGGTGGACCAGGAGGGCCTGGCGGTCCCGGTGGACCAGCTGGACCAAATGGACCCGGCGGACCCAATGGACCGAATAGACCCAATGGACCCAATGGACCGAGTGGGCCAAATGGACCAAGTGGACCTAATCAACCTTTAGGACCTGGTGGACAAACTGGACCTGGACAAGGTGGATCTTACCCGGTTAGACCAGGAGCACCCGGTAGCCCGGGAGCACCAGGTGGTCCCGGTGGACCCGGCGGTCCCGGTGGTCCCGGAGGACCAGCAGGACCAAGTGGCCCTAGCACACCTCCATCGCAAGGACCTGGTAGCGGATATTATCCCGGACAAGGACAAGGTGGGTACTACCCTAGCGGTCCTGGATCTCCCGGTCAACCGGGAAGTCCTGGCAGCCCCGGTAGCCCTGGCAGCCCTGGCAGCCCCGGTAGCCCTGGATCACCAGGTGGACCAGGTGGATCTTATGTACCCAGTGGACCTAGTGGACCGGTCGGACCCAATGGCCCGTATGGACCCAATAGACCGAGTGGACCAAATCAACCATCAGGCCCCGGTGGACAAATTGGACCCGAACAAGGTGGATCTTACCCCGTCAGACCTGGTGCACCTGGTAGCCCTGGAGCACCTGGTGGACCCGGTGGACCAGGAGGGCCTGGCGGTCCCGGTGGACCAGCTGGACCAAATGGACCCGGCGGACCCAATGGACCGAATAGACCCAATGGACCCAATGGACCGAGTGGGCCAAATGGACCAAGTGGACCTAATCAACCTTTAGGACCTGGTGGACAAACTGGACCTGGACAAGGTGGATCTTACCCGGTTAGACCAGGAGCACCCGGTAGCCCGGGAGCACCAGGTGGTCCCGGTGGACCCGGCGGTCCCGGTGGTCCCGGAGGACCAGCAGGACCAAGTGGCCCTAGCACACCTCCATCGCAAGGACCTGGTAGCGGATATTATCCCGGACAAGGACAAGGTGGGTACTACCCTAGCGGTCCTGGATCTCCCGGTCAACCGGGAAGTCCTGGCAGCCCCGGTAGCCCTGGCAGCCCTGGCAGCCCCGGTAGCCCTGGATCACCAGGTGGACCAGGTGGATCTTATGTACCCAGTGGACCTAGTGGACCGGTCGGACCCAATGGCCCGTATGGACCCAATAGACCGAGTGGACCAAATCAACCATCAGGCCCCGGTGGACAAATTGGACCCGAACAAGGTGGATCTTACCCCGTCAGACCTGGTGCACCTGGTAGCCCTGGAGCACCTGGTGGACCCGGTGGACCAGGAGGGCCTGGCGGTCCCGGTGGACCAGCTGGACCAAATGGACCCGGCGGACCCAATGGACCGAACATACCCAATGGACCCAATGGACCGAGTGGGCCAAATGGACCAAGTGGACCTAATCAACCGTTAGGACCTGGTGGACAAACTGGACCTGGACAAGGTGGTTCTTACCCGGTTAGCCCAGGAGCACCCGGTAGCCCAGGAGCACCAGGTGGTCCCGGTGGACCCGGCGGTCCCGGTGGTCCCGGAGGACCTGGTGGACCAAATGGACCCGGTCAAGTTCCCGATTCCGTCGGTTCACCTTCCGGAACAGGTGTTCAGCCACCAGTTTACCCCCCGCCCAGTCAGCAGCCGCCATTCCCTATATATGTTATACCATATCCATTGCCGATCGTGCCAAGCCCCGCATCATGTCCCTGTTATCTCTTGAATCCGGGCCAAAATAATCAACAATCTTCTCCACAAATGCAGTATAACCAATACCCTTACCAAGGGTACCAACCTTATGGCATTATAGGGTTTATACCAGTCGTATTCGTTCCCAACTGTCCTGGAAATAATACTGGTATGCAAACTGCGCAACAAAACTTCCCTAATGCTGTATCTGTTCCCTATAATTGTGGCCAATGTCAAGCGTCGAATGACATTTACCGGTACTTCGGAAGATTAAATGGAGGACGTAGCATTGAAATGAACGACTTAAAAGAAATCAAATCTCTACCAGAACTGGAGAATCTCTTGAAGAATCAAATTAAACCTCCAAGAAAGAGTTTAAGGAGGATAGCCGTGAATGCCAGAGTTCTGGACGACATGACGAACGACAAGAAAAATAAAAAGAATTTGATAATTAAGGCGAAAGAAGATTAA

Protein sequence:

>DPOGS207948-PA
MAPGLLFYFLVSLVVAHATEDANETRALIQEDAHEARDYGTYGDNSKETVVNIEDDEKTQYYETNYDTSAYGFGYDVGPNGQFHHENKGPDGVTYGCYGYVDPDGYLRVTHYVADSHGYRIIEPEKPVEVFPEENHEYDENLVTPSPLPGQIVPWKKLYMPRGCGKTPGGIPPRPLPKPKPTSPPRPPPDSAGQNSNPKPGVVYPGGQGGYYPGTPGTSGSPGTPGSPGIPGRPGSPGTPGSPGGPGGPSGPSGPGGQNSGYYPGQGQGGSYPVRPGAPGSSGAPGSPGAPGSPGAPGGPGGPGGPGGPSGPSGPGGQNSGYYPGQGQGGSYPVRPGAPGSSGAPGSPGAPGSPGAPGGPGGPGGPGGPSGPSGPGGQNSGYYPGQGQGGSYPVRPGAPGSSGAPGSPGAPGSPGAPGGPGGPGGPGGPSGPSGPGGQNSGYYPGQGQGGSYPVRPGAPGSSGAPGSPGAPGSPGAPGGPGGPGGPGGPSGPSGPGGQNSGYYPGQGQGGSYPVRPGAPGSPGAPGGPGGPGGPGGPGGPAGPSGPSTPPSQGPGSGYYPGQGQGGYYPSGPGSPGQPGSPGSPGSPGSPGSPGSPGSPGGPGGSYVPSGPSGPVGPNGPYGPNRPSGPNQPSGPGGQIGPEQGGSYPVRPGAPGSPGAPGGPGGPGGPGGPGGPAGPNGPGGPNGPNRPNGPNGPSGPNGPSGPNQPLGPGGQTGPGQGGSYPVRPGAPGSPGAPGGPGGPGGPGGPGGPAGPSGPSTPPSQGPGSGYYPGQGQGGYYPSGPGSPGQPGSPGSPGSPGSPGSPGSPGSPGGPGGSYVPSGPSGPVGPNGPYGPNRPSGPNQPSGPGGQIGPEQGGSYPVRPGAPGSPGAPGGPGGPGGPGGPGGPAGPNGPGGPNGPNRPNGPNGPSGPNGPSGPNQPLGPGGQTGPGQGGSYPVRPGAPGSPGAPGGPGGPGGPGGPGGPAGPSGPSTPPSQGPGSGYYPGQGQGGYYPSGPGSPGQPGSPGSPGSPGSPGSPGSPGSPGGPGGSYVPSGPSGPVGPNGPYGPNRPSGPNQPSGPGGQIGPEQGGSYPVRPGAPGSPGAPGGPGGPGGPGGPGGPAGPNGPGGPNGPNRPNGPNGPSGPNGPSGPNQPLGPGGQTGPGQGGSYPVRPGAPGSPGAPGGPGGPGGPGGPGGPAGPSGPSTPPSQGPGSGYYPGQGQGGYYPSGPGSPGQPGSPGSPGSPGSPGSPGSPGSPGGPGGSYVPSGPSGPVGPNGPYGPNRPSGPNQPSGPGGQIGPEQGGSYPVRPGAPGSPGAPGGPGGPGGPGGPGGPAGPNGPGGPNGPNRPNGPNGPSGPNGPSGPNQPLGPGGQTGPGQGGSYPVRPGAPGSPGAPGGPGGPGGPGGPGGPAGPSGPSTPPSQGPGSGYYPGQGQGGYYPSGPGSPGQPGSPGSPGSPGSPGSPGSPGSPGGPGGSYVPSGPSGPVGPNGPYGPNRPSGPNQPSGPGGQIGPEQGGSYPVRPGAPGSPGAPGGPGGPGGPGGPGGPAGPNGPGGPNGPNIPNGPNGPSGPNGPSGPNQPLGPGGQTGPGQGGSYPVSPGAPGSPGAPGGPGGPGGPGGPGGPGGPNGPGQVPDSVGSPSGTGVQPPVYPPPSQQPPFPIYVIPYPLPIVPSPASCPCYLLNPGQNNQQSSPQMQYNQYPYQGYQPYGIIGFIPVVFVPNCPGNNTGMQTAQQNFPNAVSVPYNCGQCQASNDIYRYFGRLNGGRSIEMNDLKEIKSLPELENLLKNQIKPPRKSLRRIAVNARVLDDMTNDKKNKKNLIIKAKED-