Monarch geneset OGS2.0

DPOGS215529
TranscriptDPOGS215529-TA5331 bp
ProteinDPOGS215529-PA1776 aa
Genomic positionDPSCF300467 + 80416-90530
RNAseq coverage152x (Rank: top 53%)
Annotation
HeliconiusHMEL0055350.054.16% 
BombyxBGIBMGA004226-TA1e-6935.94% 
Drosophila% 
EBI UniRef50UniRef50_B0WU782e-1025.15%Pacifastin light chain n=1 Tax=Culex quinquefasciatus RepID=B0WU78_CULQU
NCBI RefSeqXP_001858079.14e-1125.15%pacifastin light chain [Culex quinquefasciatus]
NCBI nr blastpgi|1700496947e-1025.15%pacifastin light chain [Culex quinquefasciatus]
NCBI nr blastxgi|3454879461e-2122.53%PREDICTED: hypothetical protein LOC100678556 [Nasonia vitripennis]
Group
Gene OntologyGO:00304144.4e-09peptidase inhibitor activity
KEGG pathway 
InterPro domain[180-217] IPR0080374.4e-09Proteinase inhibitor I19, pacifastin
Orthology groupMCL25001 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215529-TA
ATGGCCTGCGTTCAAAATCCAGTTTCAAACCCGTCAAAAGTCACGAAAGAATATCTTAGAAGTGAGACCAAAGGATCAGACGTGAAGCAAAAAAAGAAAAGTAACAAAAAAGAGAAGGTTAGGAAAAGGGCAGTACTACCGAAGTTGCCCAAAGGCCGTTTAAGTGACCAAGATATAGCAGAAATAGAAGGAAGATCTGAATATGAATTACCGCAATTGCCGCATGTGGGAGTCGAGTGTGAACCCGGCCAATCCTATCTCGTGGAGTGCAATGTATGTTTCTGTTCAGAGAGACGAGATCTTTTCTGCACTATGAAATACTGCCTTAATGTGAAGAGCTATTCAGAGAACGCCCTTGCTGAATATGAAGGAAAACCTTGTGTCAGTGATTTTAAAAAGTATTGTATGAACTGCAAGTGCGTAAATAAGAAAAGTTCGTGCAATTTATTAAAGAACTGTAATTCTATTATGACTGGGCGACAACTGCTCACTGATGGGGATCACGTAGTGAAACTGGCTTTGGACGTTAAAAAAGACCATTGCACTCCTAATGTAACTTACAAAGTTCACTGCAACGATTGTTACTGTCAGCCCGAGGGTGATTTGAGATGCACTCAGAAAATGTGCCTCACTTACTCTCAAACGAAACAATTACAGGATCGGGAAGAGTATCTAGAGAAACATGGAACGTGCAAGCCAAGTTCTTATTTCACACTGAATGGTCGACTGTGCTATTGTGATTATCTAGGCAGATGGTCCGAAACAAATTGCAAAATTATAGCTCGAAGGCAATCATGTCAGCCGGGACAGACAATTTGGCAAGGTTGCCATCGATGCACATGCCAAAGAAATGGTCAGCTTTCTTGTTCAAATGATTGTCAGTTAGATGCTGCAAGACCACATGATTTCCTTGATTACGGCTCTATATGCACACCGTATAGATCATATTATGTTAATTGCAGTCTCTGTTTCTGTCCTGCTTCAGGGTTAACCTTTAGTGCACAGTGCGTGAGTGATTCTTCATGTTCCGTAGACCCTAACACTTCAGATATCCAGATATTGGCGAAAGCCAGTCAATGTATACCCAACGTCATGTATTTATTCCCTTGTATTCAGTGCCTATGTTCGGAAACAGGATATTTTATTTTAGATAAATGTTTAGAAAAATGTCAGTCGCAAGTCAAACCGCAACGTAGATGTATCCCCGGTTCCTTATACAGAAAAGATTGTTTAGTATGTCGTTGTCCGGACGATAGCATACCAGATGAAAAGCTGTGTGTGAAAGGATTGTGTCATAAAAATAAACATTTGTACTCATTGCGATCAACACCAAATCGTTGCGTTCCACACACTTTCACAAAACCTGTATGCCTTTATTGTGACTGCGGTTCTGAAGGCACTGTCAATGAGGACTCCTGTCTAGAGTTGGATTGCTCTAAAGCACTAGAGTTCAAGACATATACTGAAACAGAAACTTGTAATCCTGGAGAATTGGTCTCAATTTGTATAGAATGTTTTTGTCTCAATGATGGTCGGACGAAAAATATATATTGTACAAGAGTATGCACGTACCAAAGTAAATTGAATGTTCTAGAAAAACTTTTAAATCACAGTCTCACGGATGTAAGTTTGATTGACAAAAGTAAAATAACAAAAGCTAGAACCGGCGAAGATTGCAAGCGTAATACTCTTTATATAGACGAAGGCAGATACTGCTTGTGTTCCAATAATATGGACGGAAGCCTGAAATTCTGTACTTCATTCGTAGAAAATATTGCAAAGGTAGCCGAAACTACCAAATTATTAAGTAAGAACCACATAGATATAACGCAAGATTGCGAATCAAATACCCTAGTGGATTTCGATTGTAATACATGCTACTGCAATAAGAATGGAAAAATTGATCCTAAATGGTGTACTGACGATGATTGCGAAGCTAAAAGAATAGTCGAAGAATCTCATAAAGTTGTTCCGGGTTCTATATCAAAAATAAAATGTAATTTCTGCATTTGTCCTGACAGCGGGGAGCTTAAAGAACGAGTATGTACGAAAAACATATGCGAGGATAACAAGGCGATGATGTACGAAAAGTTCACTTGCGAACCCCTCGCATATTATGAAGTCGACTGCAATATCTGCTATTGTCCCCAAGATGGCTTGAAAAATGTAGCTATGTGCAGCAAAAATCAATGCGAGAAATCTTTCCTTAGATCGACTGAGTGCATCCCGGGTAACTTGTTCAGTGATGAATGCAACGTTTGTGTGTGTCCGCCAAATGGAAATAAAATAGATAAAGTATGCATGAATCACACGTGCAGCTTACCACCATGGAATACCATCATATTGTCCCATACATTGGTGGAGCAGCAGATAAATAAGGATCCGACGAGGAGTTTGGACCTGTGCTATCCGGGTGAAGAATTCGTGATGGGCTGCAAATTATGTGTATGTCCAGACTTAGGACTAAAAGTGTATGCAACCTGTGAATCAGTGCTGTGCAAAGACAACTTTATGTCATTTAGAAACATATCTAAAGACGACGGTGGTAGAACATCCGATAATTCCGAAAACGAGTCAAATTTTTTATACCACTCGAGACAAAAGAGGGAAGATATCAATACGTGCTTTCATTTAAACATCTCTCATAGCGCGGAGAGAAAGGACTGCACTCCTGGTACAACGTATATTATTAGATGCAAGCAGTGTATATGTCCTTATATCGGAAACATCAACAATTTTTGCCGTCCATTGCCAAAATCAACGTTTTGTGAAGAAGCATATCCCGGCTTTAACTATTTGCCTATGGGCCGAAGAAATCGTGCAAACACGAGTACTACAGACTCTAACAGCAAGGTATACACAGTGACCGTTAAACATTTAAATCACACCGTACACAAATGCGACGAGCCCGGGACCTTCAGAGATGAATGTCACATATGCCAATGCGAAAATAATATTATTATAGAGGAACACTGTTTTAAAAGTGACGCCAAAAACTGCAGCGATGGACAAAACTATGAATGTGATCCCAACAGAATCTACAAAGGAAAGAATATCACATGTGCCTGTTCTAGTAACGGTCTGTGGCTGGAAAAGGATTGCGAAGGATATTTAGATCCTAGAGGAGAACCCGATTGCGAACCAAACACGTACACATTTCTTGATTGCAATGTGTGCCTTTGCGGACCAGACGGCAGAATTAATAAGAATCGCTGCACCAAACACGACTGTGAACCAATTATCACCCGCAGATCCAAATCTATTATAGGTACCTGTCGAGTCAATACATTCTATTCATTAGCACCATGCCAATTCTGTTATTGCGTAAACAAACACAAACTAGTTTGCAACGCCGCTCCGAAGACTGAGAAAGTGGTCTTAGGAAAGTTTGAATTAAACCAATGTGGTGGCAATATATTAAAGGAAATATCTGATTTAATGCCAGAGAAACCTCTCAGGTCTGGAAAGACGTCCAAGAAGGATAAATCGAAAACAGCCAAGAGCAACAAAGCGACAACAGTTCTGGATATAGGAACAAATATCAGGTCTATTGACGATAAAACAGATAAAAGCGATGAAGTACAAACAGGTGCGAAACATCAAAACAAAAAGAAACATCGAAATAAAGATGGCGTCAAAAAGGATAAATACGTAAAGAAAAAAACTAGTGAAGCTATTGTTGATGATGGTTGGAGTGATTATTCTGAAGAGAAACAGAAGAAGAAAAATAATTCAAAAAAATCTAAATCATTCCACACAAAGACCGAATCTAATCCAGAGACAGGCGACAAAGAGGAAAACAATCTTCTCACAATTAATTTACCATACGTATTAGATAGAGTACTTAATATGGTTCTGCGTAAATCCATGGTGTCTTTGGAATCAGCATTGCCTCAAAAACTATGGAGTGAACGTTGTACCAAGAACTCTATCGCTTTGAATGATTGCAACTGGTGTTGGTGTGATGTCAGGCAGAGGTTTCAATGTAAAGCCCGAATCTGTGAGGAGGTTGATATGTTTGGACATTTTAAAGACGCTATCCGTGACATCGATGTGGGGATGGAAGGACACGGTTCTTGGCGATCTTCAATGACCCCGTGCACACCCGGTGTTCATTATAGAAGGGGGGATGTTTTATGCTTGTGTGATGAAGATGGAAATTGGCCGAATCCAGTCTGCAGAGACATCTTCAGAGTTTTGCATGCTGTGGAAGTCCATCGGGATACTGTTAATAAAACCTGTACTCCTTCTAAACTATATCTCATTGGCTGTAACGTCTGCTTCTGTTCATCATCAGGAATTTTAGATCCAGAATTCTGTACAAAACAAGAATGTCAGGAAGATGACCCAGCGCTACCGGAAAACCGACAGATAACCCAGTCTGATTATGAAAAAATTTCTGAAGTGTACGCCAGATGTGATTCTGACGAAGCTTATGAGCTTGGATGTAGAAAATGCATTTGTCTGAAAAATAATAGACTTATATGTAATGACTGTTCAAAAGAACAGACAACCGTGACAGAAAAGCCTTTATACAGAAAGAGAAGACGCAAAAGGAGGAGAAAGAGAATAAGAAATACCAGATTTGGATTATGCGAAGGAAAATATCCCAAAGAGAAATTCTCACTCGGTTGCAGTACCTGCTTCTGCGACAAGTACTCAGGAATATACTGCTCAGTGAGGAAGTGTCTTAAGCCTATAAGAACACTGAAAGCGAGTCTGTCCCTCTCTCAAAATGATCCATCTATAGTTCCTCCAAAAGTAAAACAAGTGGAACCACCACTGGACGATGACATGTGTCAGTCAAACACCAAATATACCAAACGATGCAATGAATGCATCTGTCTCAAACTAGAAAACAATGTCAAGGTTTTGGATTGTTCACTGAAAAAATGTTCGTCTTCACAAGTTGATGAAATGTTCGAGAATGACTGCGTCGTTGGCAGCGTGTACATGAGGGATTGCCGCATTTGTTACTGTTATACCATTGACGGAGTCAGACACCAAGTATGTCACGCCAACGCAGAGTGCGGGGAGAATAAAAATGAGTTGGACATGGGATTTTGTATACCTATGCGCATGTACAAGAAAGACTGCAACACCTGCAAATGCCTGTCCGATGGAAAAACTTTAAAGTGCAGTAGTGCACCTTGCACGGTAAGGTCGTCGAACCCAGTCTCAGTGGATCTGGTGCCGGTAACAATGATGAACGGCGACCCCTGTCCGAAAGGTTTCTCCTATAAATTGGACTGCAATGTATGCTTTTGTCTGTCTAATGGGAACGCTATATGCACAACGAGAGACTGTTCTAACGATGACCAAGAATTTTGA

Protein sequence:

>DPOGS215529-PA
MACVQNPVSNPSKVTKEYLRSETKGSDVKQKKKSNKKEKVRKRAVLPKLPKGRLSDQDIAEIEGRSEYELPQLPHVGVECEPGQSYLVECNVCFCSERRDLFCTMKYCLNVKSYSENALAEYEGKPCVSDFKKYCMNCKCVNKKSSCNLLKNCNSIMTGRQLLTDGDHVVKLALDVKKDHCTPNVTYKVHCNDCYCQPEGDLRCTQKMCLTYSQTKQLQDREEYLEKHGTCKPSSYFTLNGRLCYCDYLGRWSETNCKIIARRQSCQPGQTIWQGCHRCTCQRNGQLSCSNDCQLDAARPHDFLDYGSICTPYRSYYVNCSLCFCPASGLTFSAQCVSDSSCSVDPNTSDIQILAKASQCIPNVMYLFPCIQCLCSETGYFILDKCLEKCQSQVKPQRRCIPGSLYRKDCLVCRCPDDSIPDEKLCVKGLCHKNKHLYSLRSTPNRCVPHTFTKPVCLYCDCGSEGTVNEDSCLELDCSKALEFKTYTETETCNPGELVSICIECFCLNDGRTKNIYCTRVCTYQSKLNVLEKLLNHSLTDVSLIDKSKITKARTGEDCKRNTLYIDEGRYCLCSNNMDGSLKFCTSFVENIAKVAETTKLLSKNHIDITQDCESNTLVDFDCNTCYCNKNGKIDPKWCTDDDCEAKRIVEESHKVVPGSISKIKCNFCICPDSGELKERVCTKNICEDNKAMMYEKFTCEPLAYYEVDCNICYCPQDGLKNVAMCSKNQCEKSFLRSTECIPGNLFSDECNVCVCPPNGNKIDKVCMNHTCSLPPWNTIILSHTLVEQQINKDPTRSLDLCYPGEEFVMGCKLCVCPDLGLKVYATCESVLCKDNFMSFRNISKDDGGRTSDNSENESNFLYHSRQKREDINTCFHLNISHSAERKDCTPGTTYIIRCKQCICPYIGNINNFCRPLPKSTFCEEAYPGFNYLPMGRRNRANTSTTDSNSKVYTVTVKHLNHTVHKCDEPGTFRDECHICQCENNIIIEEHCFKSDAKNCSDGQNYECDPNRIYKGKNITCACSSNGLWLEKDCEGYLDPRGEPDCEPNTYTFLDCNVCLCGPDGRINKNRCTKHDCEPIITRRSKSIIGTCRVNTFYSLAPCQFCYCVNKHKLVCNAAPKTEKVVLGKFELNQCGGNILKEISDLMPEKPLRSGKTSKKDKSKTAKSNKATTVLDIGTNIRSIDDKTDKSDEVQTGAKHQNKKKHRNKDGVKKDKYVKKKTSEAIVDDGWSDYSEEKQKKKNNSKKSKSFHTKTESNPETGDKEENNLLTINLPYVLDRVLNMVLRKSMVSLESALPQKLWSERCTKNSIALNDCNWCWCDVRQRFQCKARICEEVDMFGHFKDAIRDIDVGMEGHGSWRSSMTPCTPGVHYRRGDVLCLCDEDGNWPNPVCRDIFRVLHAVEVHRDTVNKTCTPSKLYLIGCNVCFCSSSGILDPEFCTKQECQEDDPALPENRQITQSDYEKISEVYARCDSDEAYELGCRKCICLKNNRLICNDCSKEQTTVTEKPLYRKRRRKRRRKRIRNTRFGLCEGKYPKEKFSLGCSTCFCDKYSGIYCSVRKCLKPIRTLKASLSLSQNDPSIVPPKVKQVEPPLDDDMCQSNTKYTKRCNECICLKLENNVKVLDCSLKKCSSSQVDEMFENDCVVGSVYMRDCRICYCYTIDGVRHQVCHANAECGENKNELDMGFCIPMRMYKKDCNTCKCLSDGKTLKCSSAPCTVRSSNPVSVDLVPVTMMNGDPCPKGFSYKLDCNVCFCLSNGNAICTTRDCSNDDQEF-