Monarch geneset OGS2.0

DPOGS200980
TranscriptDPOGS200980-TA663 bp
ProteinDPOGS200980-PA220 aa
Genomic positionDPSCF300147 - 487416-488839
RNAseq coverage8552x (Rank: top 2%)
Annotation
HeliconiusHMEL0023782e-6255.45% 
BombyxBGIBMGA009047-TA3e-3041.44% 
DrosophilaCG32354-PA3e-1332.84% 
EBI UniRef50UniRef50_F0VGR83e-2837.50%Serine protease inhibitor dipetalogastin n=4 Tax=Sarcocystidae RepID=F0VGR8_NEOCL
NCBI RefSeqXP_001630534.11e-2237.24%predicted protein [Nematostella vectensis]
NCBI nr blastpgi|2953155361e-2940.29%Kazal-type inhibitor [Panstrongylus megistus]
NCBI nr blastxgi|349214265e-3741.84%RecName: Full=Serine protease inhibitor dipetalogastin; Short=Dipetalin; Flags: Precursor
Group
Gene OntologyGO:00055154.3e-13protein binding
KEGG pathwaydre:5653736e-17 
 K06254 (AGRN)maps-> ECM-receptor interaction
InterPro domain[23-65] IPR0023504.3e-13Proteinase inhibitor I1, Kazal
[74-110] IPR0114977.1e-11Protease inhibitor, Kazal-type
Orthology groupMCL25568 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200980-TA
ATGATTCATCGAGGCTACATGCTCTTCCTCGTGGGCACGTTCGCGTCAATGACGTCAGCTCTGCCGCCGTGCGTGTGTCCGAAGAACATGAAACCGGTATGCGGTTCCGACGGCCAGACTTACAACAACGAATGTCTCCTGAACTGCCAGAAGATTGATAACCCTGACCTCGTCGTGGATAAAGTCGGAAGCTGCGAGCAAAAATCCGGTGGATGTTTTTGTACATTCGAATATTCGCCGATCTGCGGTAGCGATGGCGTGACCTATGCGAATCAATGCGAATTTGACTGCGAGGCGGGTGACGCAAAACTAATGTACAGGGGCGAGTGCAGGGCGAAACGGGAAGCGCCGCTAGTAGTGCAGATACCGGACTGTTCGTGTTCTAGAGAGGCCAAACCCGTCTGTGGAACTGATGGACATACATACAACAACCCGTGTATGCTGAACTGCGCCAAAGATGTCTTAGAAGACCTCCACGTTTTCCACGAGGGACCCTGCATGATTGAGGGCAAAAAATTTGATCCCGAAGTCCGTAACTGCGGGTGCACCAGAAACCTTCAGCCAGTGTGTGCTTCTGACGGCGTCACATACAATAATGAATGCCTCATGAGGTGCGCCGGCGAGGACCTTGTGGTACAGAAGGACGAACCCTGTGATGATTAA

Protein sequence:

>DPOGS200980-PA
MIHRGYMLFLVGTFASMTSALPPCVCPKNMKPVCGSDGQTYNNECLLNCQKIDNPDLVVDKVGSCEQKSGGCFCTFEYSPICGSDGVTYANQCEFDCEAGDAKLMYRGECRAKREAPLVVQIPDCSCSREAKPVCGTDGHTYNNPCMLNCAKDVLEDLHVFHEGPCMIEGKKFDPEVRNCGCTRNLQPVCASDGVTYNNECLMRCAGEDLVVQKDEPCDD-