Monarch geneset OGS2.0

DPOGS205092
TranscriptDPOGS205092-TA2682 bp
ProteinDPOGS205092-PA893 aa
Genomic positionDPSCF300074 + 336204-354374
RNAseq coverage95x (Rank: top 62%)
Annotation
HeliconiusHMEL0033290.075.89% 
BombyxBGIBMGA006885-TA0.074.31% 
DrosophilaCG1632-PA5e-13139.85% 
EBI UniRef50UniRef50_D6X2045e-16543.24%Serine protease H129 n=11 Tax=Neoptera RepID=D6X204_TRICA
NCBI RefSeqXP_623899.27e-17542.73%PREDICTED: similar to CG1632-PA [Apis mellifera]
NCBI nr blastpgi|3504262573e-17843.73%PREDICTED: atrial natriuretic peptide-converting enzyme-like [Bombus impatiens]
NCBI nr blastxgi|3504262570.043.86%PREDICTED: atrial natriuretic peptide-converting enzyme-like [Bombus impatiens]
Group
Gene OntologyGO:00038241.2e-43catalytic activity
GO:00055154e-29protein binding
GO:00042522.8e-26serine-type endopeptidase activity
GO:00065082.8e-26proteolysis
KEGG pathway 
InterPro domain[652-889] IPR0090031.2e-43Peptidase cysteine/serine, trypsin-like
[335-453] IPR0200674e-29Frizzled domain
[672-884] IPR0012542.8e-26Peptidase S1/S6, chymotrypsin/Hap
[482-523] IPR0021721.3e-09Low-density lipoprotein (LDL) receptor class A repeat
Orthology groupMCL15588 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205092-TA
ATGAAATTGCCGTTAGTTCGCCGTTTCCAAGTATCTTTTCCGTTTGGTTTAAATAAATATTTTGACCCGATCCGCGGAGGTTGTGGTGACTTTCGTTGGCGCGAAACTTATGGCTGTAAATGCGGCTTTGGTTTGGAGAAAGTTAACTGTAGAGTGCGTGCCCGTCTCCCGGCGTTCGAGACATTCAGACATCAGTCGACGCCGAGCTCGCCTCCGAGTGAAGCTTTGTATTGCTTTGTGAACGATTCAGTGTGCTATATACGGACTGAGCGGTACTTGAGAACAAAAATGACATTCTCTGGGAACATGAAAGAAAGCAAACCGCGAACCGACACATGGGAGACGGAGCTGGGATACGGCTGGTCTCGTAAGGAGCAACGTCGTTGTAGAGCACACAGGCCGCCGGAGTCCACGATGTCTGTCAGCAGTGACATCAGGTTTACTCGACGCAAACTAAGCAGACCTTGTCGGGGATGCTGTGCTGCTCTTGCGGCTCTACTCGTCCTATTGCTGCTCGGTGCTGTTGCTGTCTACCTCGGCCATATGTATCTGTTTGGAGATCCATTGAACCGGCAAACCTTCCGAGGGTCATTTGTGGTGTCGACATGGGATTCGAGTGACGAAATGCTATCAGACTCGAAAAACGGCACCCGCGAGATGGAAATGCGGCAGGTCCTTTATAACACATATAGAACCTCCGAGTTACATTCTTGTTTTGTCTCAGCTGAAATCTTAGCTTTAGACAGTGTGTCTGAAGGGACTAGGGTTCACTTCGAAGTATCCTTCGAGCCTATCTTCACAGCAGTCAGCACGCCGGATGTAGCCGCTGTTTTAGAACGGGAGCTTGAAACCCCTTTCTTTCTAAATATAGGAGTTTTAGCAGAAACTTTACATATTGAGGAAAGCTCGATTATATCTTCTCAAGCATTAAAAGAGGGTGATACAACAAGTACTGAGGCTCCCACCACAGAAGAAATAACATTAGAAATAGAAGAAGAGCTCCGGGAATGCAGTTCACTAAGCCTGCCTCTGTGTTCTCACCTACCCTATAATACGACTTCATATCCAAACCTAGTTGGACACACGTCTAAGGAAGTTCTACTTAGAGATCTTGTCGCATTCAGAGAACTTCTAGACGCGGAATGTTCCAATCTAGCGCAAGATTTCGTATGTCAAATGTTACAACCACGATGCGAGGACGATCATGTGATCCGTCCTTGCCGGGCGTACTGTCGCGCCTTTCACGCAGGTTGCGGGTCACGGCTCCCTGAACGACTGAGACCACACTTCGACTGCTCGAGATTCCCGGACTACTTCGGACCAGGGTCATGCCTCCCTGAACCGGATTGCTTGGGTGGTCTTCAACGCCTGGCTCTGTCTCGTCGTGCGTGCGATGCTGTCCCGGACTGTGCTGATGCGGTGGACGAGCGCTCTTGTTCTCATTGCTCGTCAGCAGGCCCGGGATCGTTACGCTGCGCCTTACAGCCTCGCTGTCTTCCACAGCACCTGCGCTGCGATGGAACCCCGGACTGCGCTGACGGCAGCGATGAAGCCGGCTGTTTGTGGGTAACCCGTTCGTTGTCGTCATGGCGTCGCGAGACTCGTGAGAGTACACTGGGCGCGATCCGTCACCGGGCAGGGTACGCGGTCTGGGCAGAGCGTGGCCGTGTCGGCAAGATCTGCGCGGAACCCTACGCACACGACCGGAAGGCGCTCATGAACGTAGCGACCTCCCTGTGCACCGCGCTAACATTCAAATCTGCAGTGTCAGCCGAAGCGGTGCCTGATGCTGAGGAAGAGCCTGATGAAAAAAACGAAAATACACAAACAGTATCCCAAAAAAATGAGCCGGAATATGTTGAAATAGTCGACCCCGCTGCAGCAGAAATATCGTTTGTCAAGAGCGACTGTCCTCAGAGGAAGGTTATCAAGATCATCTGTGATCAACTCGAATGTGGAATACCGTCCGCACGAGGCGCGGGCGCTACACTGGGTGTGCAGCATCTACCACGCTCGGCACGGCCCGGCGACTGGCCCTGGCACGCGGCTCTACTGCGTACGCACGTGCATGCATGTGACGCACTCCTCGTGCATGCTGCCTGGCTCGTTACCACCGCATCTTGCTTCCAAGGTCAACCGAAAGCGGAATGGACTGCTAGATTGGGAATTGTACGAATTCAAAGCACTTCACCCTGGCAACAAGAGAGGAGGATTGTAGGAATGGTGAAGTCACCGGTTGAAGGCAGTATGTTGGCGATGGTACGATTAGAAGAGCCAGTTGAAATCACAGACTTTGTCCGTCCTGCTTGCTTACCCATCGATGGCTTTAAAACGAATGAACATAGCATCTGCAATACCCTGGGATGGACGAGAAATAGGGATCAATTGCAAAGAGTTCACGTTGTCCCCACTACTATGAACACTTGCGAGAACGTTAGCATAGCAACTGGCAATGGCATATGTGCAGAACCATTGTATGATCAAGACGATTGCGACGAAGAAGAATATGCAGGCAGTGCAATGATGTGCTTTGACGAGAAATCGAAACACTGGTCTGTGATAGGAGTCAGCAGTTGGAGAATAGCGTGCTCAAAGATTGGATTGGGAAGGCCCAGGATATACGACGCCGTTACATCACATATCGACTGGATTCGAAGAACAATTGCAAACTCTTCAAGATGA

Protein sequence:

>DPOGS205092-PA
MKLPLVRRFQVSFPFGLNKYFDPIRGGCGDFRWRETYGCKCGFGLEKVNCRVRARLPAFETFRHQSTPSSPPSEALYCFVNDSVCYIRTERYLRTKMTFSGNMKESKPRTDTWETELGYGWSRKEQRRCRAHRPPESTMSVSSDIRFTRRKLSRPCRGCCAALAALLVLLLLGAVAVYLGHMYLFGDPLNRQTFRGSFVVSTWDSSDEMLSDSKNGTREMEMRQVLYNTYRTSELHSCFVSAEILALDSVSEGTRVHFEVSFEPIFTAVSTPDVAAVLERELETPFFLNIGVLAETLHIEESSIISSQALKEGDTTSTEAPTTEEITLEIEEELRECSSLSLPLCSHLPYNTTSYPNLVGHTSKEVLLRDLVAFRELLDAECSNLAQDFVCQMLQPRCEDDHVIRPCRAYCRAFHAGCGSRLPERLRPHFDCSRFPDYFGPGSCLPEPDCLGGLQRLALSRRACDAVPDCADAVDERSCSHCSSAGPGSLRCALQPRCLPQHLRCDGTPDCADGSDEAGCLWVTRSLSSWRRETRESTLGAIRHRAGYAVWAERGRVGKICAEPYAHDRKALMNVATSLCTALTFKSAVSAEAVPDAEEEPDEKNENTQTVSQKNEPEYVEIVDPAAAEISFVKSDCPQRKVIKIICDQLECGIPSARGAGATLGVQHLPRSARPGDWPWHAALLRTHVHACDALLVHAAWLVTTASCFQGQPKAEWTARLGIVRIQSTSPWQQERRIVGMVKSPVEGSMLAMVRLEEPVEITDFVRPACLPIDGFKTNEHSICNTLGWTRNRDQLQRVHVVPTTMNTCENVSIATGNGICAEPLYDQDDCDEEEYAGSAMMCFDEKSKHWSVIGVSSWRIACSKIGLGRPRIYDAVTSHIDWIRRTIANSSR-