Monarch geneset OGS2.0

DPOGS216020
TranscriptDPOGS216020-TA2304 bp
ProteinDPOGS216020-PA767 aa
Genomic positionDPSCF300078 + 827092-836528
RNAseq coverage1826x (Rank: top 7%)
Annotation
HeliconiusHMEL0164480.066.80% 
BombyxBGIBMGA001091-TA0.070.49% 
Drosophilafat-spondin-PA0.051.09% 
EBI UniRef50UniRef50_Q7KN040.051.09%Fat-spondin n=24 Tax=Endopterygota RepID=Q7KN04_DROME
NCBI RefSeqXP_975464.20.053.58%PREDICTED: similar to f-spondin [Tribolium castaneum]
NCBI nr blastpgi|1892365260.053.58%PREDICTED: similar to f-spondin [Tribolium castaneum]
NCBI nr blastxgi|1892365260.053.97%PREDICTED: similar to f-spondin [Tribolium castaneum]
Group
Gene OntologyGO:00048677.8e-21serine-type endopeptidase inhibitor activity
KEGG pathwaygga:4148372e-12 
 K04659 (THBS)maps-> Malaria
    TGF-beta signaling pathway
    Focal adhesion
    Phagosome
    ECM-receptor interaction
InterPro domain[613-666] IPR0022237.8e-21Proteinase inhibitor I2, Kunitz metazoa
[712-767] IPR0008841.9e-13Thrombospondin, type 1 repeat
[18-151] IPR0028611e-12Reeler domain
Orthology groupMCL14336 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216020-TA
ATGTGGTGGACTGTTGGAGCGTGGTGTTTGATAGTGGTCTGTGCGAAGGCTTGTGACTTACATCCAGGGGAAGGTGCTGGGGAGAAATCACCGGGGGACAATTTCTATCGAGTCATCATTAATGGAGACGTAGAAAGATATACTCCGGAACAAAGATATGTTGTAACTTTGGTTGGTTCGCGAACTCACGACGTGGTACAACAGTTTACCAGGTTCCAGCTGATACTGGATCCACTAGATCCATCATCTCCACGGACTCCGCGGAAACAGGGACAGTTCCAGCTATTCGCGGACACGCTAAGCAAGTTCGACGAGGAATGTACTAATTCCGTCATAGAAGCTGATGATTTGCCTAAAACCGAAGTGGTCCAATACACCTTTCGAGGTGGAGCAGGTTCAGGATGTGTCCTGATAAGGGCAATGGTTTACGAAAATGACACTCGGTGGTTCGCAGCTGACGGGCAACTGACGCATCGCATATGTGAAGAATCGCCCAGTATAGATTGCTGTGCTTGCGATGACGCAAAGTATAGTATGGTGTTTGAGGGTCTGTGGTCCCCGCAGACTCATCCTAAGGACTTTCCCGTGCAGGCTCTCTGGCTGACACATTTCTCCGACGTCATCGGTGCGACTCACGTCAAGAACTTTTCTTTTTGGGGAGAAGGGGAAATTGCTACTGATGGCTTCAGGTCTTTAGCAGAGTGGGGTTCACCAGGTCTATTGGAGCGTGAACTGCATCAACATGGTCCAGCATATCTGCGTAGCGTAGTGAAGGCTGCCGGTCTTTGGCATCCCAGGCTAAACTTGAACACGTCAGCTACTTTTACTGTAGATCGCAAAAGACATTTACTGTCACTTGCATCTATGTTTGGACCTTCTCCTGATTGGGTGGTGGGAGTGAGTGGACTTGATCTTTGCCAAAAGGACTGTACATGGACTGAAAATAAGGTCATTGATCTATTTCCATATGACGCTGGTACTGATAACGGTATCACATACATGTCTCCAAATTCAGAAACGGTTCCTCGAGAAAAAATGTATCGGATCACGACTATGTATCCTGAGGATCCAAGGGCCCCTTTTTACAATCCCGCTAGCGATAGCATGAACCCAATGGCAAGGTTATACTTGAAGCGGGAAAGTCTTATTTCAAGAGCATGTGACCAGGAAGTACTTCAGAGTCTTGTGGTCGAAGAACAAGAAAATACTCGCTCGGTCGACATACCCCAGTGTGCGGTAACTGAGTGGAGCAAGTGGTCTCCTTGCTCGGTGTCGTGTGGTAAGGGGCTACGTATGCGAACTCGCGAATATCGTATGCCACAGAAGGCGCAGATGTTCCAATGCGACCGTCAGCTTGTGTCCAAGGAAATGTGTGTCGCGGATATTCCCGAATGTCCTGATAGTGAAGACCCGATACCTGACTCTAACTCGGAGTCTGGTACCGACCTTCCAGTTTGCCGCACGACTGAATGGGGTCCTTGGGGCGAGTGCTCCGCGACATGTGGCGTTGGAATTGCTACGAGGCGACGAACATTTATTGATCACATGGGATACAAAAAATGTCCCCTAGTGAACACCGAGGAAAACCGCAAATGCATGGAGCCGCCCTGCCCTGAGGGAGAGGTTCAAGAGGTTTCGGACCCCCAATGTCCCACTAGTCCGTGGGCATCGTGGTCTCCTTGCTCAGCGTCATGTGGACGTGGTGTGGCATTCCGTACCCGTCTGTTGCTGGTGCCAGCAGACCGTCAACAAGAATGCAGCTCGCGAGTTGAACTCATGCAACAACGTCCGTGTTCCGAAAGAGAGGATTGTACAATCGACATGATTACAGCTAAGCGTATCTGTATGGAGGTTCCCGACCCGGGTCCTTGTCGTGGAGTGTACTCCCGCTGGGCGTTTTCGACTCTCAAGGGCATGTGCGTGCCTTTTAGCTACGGTGGTTGTCGTGGAAATAAGAACAACTTTATTTCTCAGGAAGACTGCACTAATACATGCTCTGTGCTAACTGGCGGGGCTCCGAGCGCGTCAGTACCTTCCCCCACGGGCGTGGGCGCGGGATTATTGCCTGTGGTCAGTTCTAACTACCCTGGCCTTAGCGTTAGTTCCATAGTTCCTGTGCCACCCGCCGTCAGCGCTAATTCTGGTGATTGCGTGGTGAGTTCCTGGGGCGACTGGAGCCGGTGTAGTGTGACCTGCGGCGTGGGTTACCAGGAACGCACTCGCACCATAGTGAAACCGGCGGCTGGAGGCGCCTCGTGCCCATCAAGACTTGTGCGCAGGCGTCGCTGTTCAAGAGCGTGCTAA

Protein sequence:

>DPOGS216020-PA
MWWTVGAWCLIVVCAKACDLHPGEGAGEKSPGDNFYRVIINGDVERYTPEQRYVVTLVGSRTHDVVQQFTRFQLILDPLDPSSPRTPRKQGQFQLFADTLSKFDEECTNSVIEADDLPKTEVVQYTFRGGAGSGCVLIRAMVYENDTRWFAADGQLTHRICEESPSIDCCACDDAKYSMVFEGLWSPQTHPKDFPVQALWLTHFSDVIGATHVKNFSFWGEGEIATDGFRSLAEWGSPGLLERELHQHGPAYLRSVVKAAGLWHPRLNLNTSATFTVDRKRHLLSLASMFGPSPDWVVGVSGLDLCQKDCTWTENKVIDLFPYDAGTDNGITYMSPNSETVPREKMYRITTMYPEDPRAPFYNPASDSMNPMARLYLKRESLISRACDQEVLQSLVVEEQENTRSVDIPQCAVTEWSKWSPCSVSCGKGLRMRTREYRMPQKAQMFQCDRQLVSKEMCVADIPECPDSEDPIPDSNSESGTDLPVCRTTEWGPWGECSATCGVGIATRRRTFIDHMGYKKCPLVNTEENRKCMEPPCPEGEVQEVSDPQCPTSPWASWSPCSASCGRGVAFRTRLLLVPADRQQECSSRVELMQQRPCSEREDCTIDMITAKRICMEVPDPGPCRGVYSRWAFSTLKGMCVPFSYGGCRGNKNNFISQEDCTNTCSVLTGGAPSASVPSPTGVGAGLLPVVSSNYPGLSVSSIVPVPPAVSANSGDCVVSSWGDWSRCSVTCGVGYQERTRTIVKPAAGGASCPSRLVRRRRCSRAC-