Monarch geneset OGS2.0

DPOGS213342
TranscriptDPOGS213342-TA5358 bp
ProteinDPOGS213342-PA1785 aa
Genomic positionDPSCF300109 - 402344-422168
RNAseq coverage50x (Rank: top 70%)
Annotation
HeliconiusHMEL0145120.084.03% 
BombyxBGIBMGA009149-TA0.072.29% 
DrosophilaCG10186-PE0.059.97% 
EBI UniRef50UniRef50_D6WRB40.054.47%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WRB4_TRICA
NCBI RefSeqXP_971920.20.055.69%PREDICTED: similar to AGAP007761-PA [Tribolium castaneum]
NCBI nr blastpgi|1892387380.055.69%PREDICTED: similar to AGAP007761-PA [Tribolium castaneum]
NCBI nr blastxgi|1892387380.056.14%PREDICTED: similar to AGAP007761-PA [Tribolium castaneum]
Group
KEGG pathwayoaa:1000845811e-30 
 K04012 (CR2, CD21)maps-> Complement and coagulation cascades
    B cell receptor signaling pathway
    Hematopoietic cell lineage
InterPro domain[267-323] IPR0160603.4e-18Complement control module
[1646-1701] IPR0004367.4e-14Sushi/SCR/CCP
Orthology groupMCL15911 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213342-TA
ATGCATCCGCTCATGTTGGGTGCTGCGGGATGTACACCTGTACAGCCTTCCTCGATCACCGAGCTATCACTCCAGCTAAGAGTAGCGTGTTCATACAGACAAGTGTTGTGTATGTATAGTCGAAAGAAGGTAGGATTGCCAGCTGCGGCCACCGACGTCCCTCGTTCCTCGCTCTCGCCGGCGAGAACAGAGCAGTTTGAAACACATTGTGTGATGATAAGCTGTCACAGTCGGTCCAACCGAGAGTGTCCCGAGCTGGATCAACCTGAAATCGGGAAAGTTTCTCTGTCAGGGAGACTGTTCGGTGACAAGGCCGTGTACTCCTGTCCTCATGGGTACCACGTGGTCGGGCTCCAGAGCAGGAGTTGCCAGGCGGACGGCAAGTGGGCAGGACAGGCACCAGCCTGCAAGGAAAACATCTACTGTCTCCAACCCCCCACCATCGAGCACGCTCGTCACTCCGCTCTCCCCGAGCAAGCCACCTTCGACCTGGACGCTACTGTTCAGTACAACTGTCACACTGGTTACGTAACCAACGGCTTCCCTCGGGCCAAATGTCTGGCGATCGACGGCCAGGCCTCCTGGTACGGACCTGATATCACTTGCGAACCTCGTTCCTGCGGCGAGCCCGGGGACGTTCCTCATGGCTGGGTGACTGCCGATTGTCACACCTTCGGCTGCAGAGCCGTGGTGCAGTGCGGACAAGGGTTCGAGCTGGTGGGGAAGGCCGAGAGGTACTGTCAGGCCGACGGCGCCTGGGCTCCCAAGGAGCTGCCCACCTGTGTCCTGGTGACGCAGGTGCAGTGTCCTCCGCCCGAGGCTCCCCGGCACGGTAAAGCCGTTTACACCTCGTGCGCGTACAATTCAGTTGTGTCGTATGAGTGCAAGTATGGATACAGACTCGTGGGTGACGCTACTAGGAGGTGTGGAGCCGACAAGAAGTGGTCCGGGACCCAGCCCGTCTGTAAAGAAATTAACTGCGGCCACCCTGGACAGCTTTGGAATGGATGGCTGGAGAACATATCTTCGGGCACAGGATTGGGAGCTTCTATTATATTTAGATGTCAGGATGGAATGAAGATGGAGGGCAATGGCTCAGCAATATGTCAGAGTGATGGCACCTGGAGCCATCCGCTGCCGCAATGTTTAGCACCATGCGTGGTTCCCCACGTGTCCCAAGGTAAGGTGGTGCTGATGGAGAACCGAACCAACGACAACGAGACCAAGGAGCACACGCAGATAGTGGGCAGTTCTTCCATGGTGCAACATGGGGAAATGATTGTTGTGGACTGTGAAAAGAATTACGAGTTTCCGTCCAACAACGTGGCCGTGACCTGCAACAATGGTACCTGGACACAAATACCAAGATGTCAGCCGGCAAGGTGTAAAAAAATGCCCAAATATCCCCGACACGGAATGGTCATAGCTCCTAAAACTGAACATGGAATGAAAGCACGTTTCCGTTGCAAAGATGGTTATGAACTGAAAGGGAATCCTATAGTGGTGTGTTCGTTTGGAGTGTGGAGCGGAGAAACGCCTAAGTGTGAAGAGGTCTTCTGTCCTTTCCCGGGATATATTGAAAACGGAAAAGTTCTTCTCGTAGGTAACATGGGACTATACGATTATAGACCCTATGTTAAAAAGGTAGTAAACAATAAGCAAATAATGTATGAATGTGAGAAAGGCTACGTGCTATCGGAAGGGCCTCCCGGTGCTACCTGCGTTGGTGGTCATTGGAGCCCTAGAGAGCTGCCTAAATGCACCCTATATCAACATCCTCGCATCCGTTGGAGTCGCAGACGGCGCTCTATACCAGAGGTAGAGTTGCGTCACAAACGGTCCGCCTATCTTAGACAATACTATAATAAGCTGAGACAGTCTGATGCTTCACAAGAAGCAGTCGATGAATTGTATGACAAATATAACCCAGAAAACCAAAAGCCAACATTGCGACATGCAAATTTTAAGCTCATGAAAAAATTCGACCTTGAATACGAAGGCGAGGAGGATGGTATGATAGGCTCCGACATACCCGACAACGGGAACTTCCCTACTGCCATCTACACTGTATACAATGTTCATGGAGAACCGGTGGGGCACAAGTTCTACTCATATCAACCAGTCTATGAACCAGAAGATGATGAGTTAGTACATCGAGTTGACTCAATGGAGTACGAAGACTCGGCCGGTCTGGAAAATGATCATTTAACAATATTGAAAGGCGGAATGTTTCCGGATAACAACATAGACAACACGAACGAAGTGTCAGAAAATATTAACAAATTGAAACATCAATACTTCGAACGATACGTCGACAAGCGAAGGAAAAGATTCTTGACCTTGAAACCCACTGACAGCAGCGGCAAAGAAATAGACTTTGACTTAAACTCAACTGACAGCAATCAACATACGACTGTAGTAGTAAAGCGTAATGTGAAGACTAGAGTGAAGCGTAATGTCAATCTAGAAGAATCCGATACTGAAGTTTCTACAAAACATTCCATACTAATGGAGTCTCTGTTTAATGAGTCCGACCCAGAATCTTTAAAAGTAATACCGCTGCCACCAAGCACGACGGAATTAAGAAACGATAAAGTAATCAATTCACAGAAAGAGTTGGCTTTTTTTCTAGAACAGTATAATAATAAAAGTTCTGGAGGAGGAAATATTGGTGAGAAACGTGAAAAAAGAACAAGCAGAAAAACTGATCGATTGAATCAAACCACAGCTTCTGTTCCGAAAACAGGTCGCGAAGGAAAGGGTGGAAGGAATCGCCAAGCACAAGGTAACGCTGAAGAAACCGATAGCCAGAGTAATGAAAATAATGAAAGTGGTAATAAAAATGGAAAAAAAGGCAGACCAAAGAGTCCGTGCGAACCGATCGAGAGTGAGCCTTATGTCAATATAGAGATTGTTAAGTATGGACGAGAACCAAACAACACTTTTAGTTCCGGCACTATAGTAAGAGTTGCTTGCGGGAAGGGCTACGGTCTCAATCTGGAGCTCAACGCTACAGCTAAATGCGTTCGCGGTAGATGGAAACCAGAAAAACCTAAATGTGAAATTTTACCGTGCCATGTTCCATCAACCGAATACGGTATTTATACGATGTCACCGAGCGCCAGCCCGGCCAGTGTGGTGTCTGGACTTAGTCAAGGAGACGAGAAGGAACTGAACGAAACAGACCCAGTCCCGAACGGCCAGGTGGTGCACTTTTCATGTGAATACGGATACAACGTTCAAGGGCCAACCAATCTGCGCTGTTGGCTCGGAGAGTGGGCTGTAACGAGCATGCCAGAATGTGTCGCAGCCCCTTGTGAACTGCCGATCTTGAACGGTGCTACGTACGAGGCCGGTTACCGCTCAGGGCTCACGGTCGCCCACGCATCGTCCGTCAACATAGCATGTGAAGCTGGAAGGAGCCCTCCAGCTACACTCAACTGCCACCTCGGCAGACTTCAGCCTACTGTCATTGATTTTTGTAGGCCTTTAGCAAATCTGTCTCGTCCACGTCCTACGTCAGAATATCAAAGTGGTTCTGACATCGTCCGTGAAGACGTGTCTGAGCTGGAACCGGAGCTGCTGGGACGAGCGGCTGAATGCGGACCGCCCGCCAGGGTGCAAGGAACGCTTATTTACCGTGATGGATCTGAAGTCAACGAAACAGTCGAAATGAACGGTTACCCTCACGGGACCGAGGTCACCTTCAGGTGTATAGCGTCCATCATGGGGGAGAAGACCACTTGGAAACTCATTTGTGAAAATGGCAATTGGGTTGGCAAGAGTTTCAACTGTGAAGATTTAGAAGCTCAAAGTGAAGAATTGTTGAGCAACAACAGCTGCAACTTCCGGAACGAGGAGCCTCACGTAGTGTCCTTCTTCAACGATCTGGAAATAACTGAAACTGTAGATTTTCCGCCTGGATCGGTAATTGTGTCCAGATGCAGTGACATTGGCAAATACGCAATGACAGGATCTCAAGTACGCAGATGTGTGGGAGGCTCCTGGGACGGCGTTAAACCAGCCTGCTTTGGTCTCAACCAGGAAAACGATTACGCTATGGAGAAGCCGCCCACCATCCTGTTCAGACACTCCCAAGGTCCCATCGCTCAGACCAACGACGGCCGGCTGCTCGTCTATCCCGGAGCCGTGCTTCACATGGAGTGCCTCTGGATACGAAGATTCGGGAACCCCAAGTGGAACGTCACACTCAACGACCCCGAGCTAGAGCGAACCTACGCGGAAGGCTGGACTACGGACCCTGGTCGCGACAGTCAGCTGGAGTACAGACTCAGCATCATCAACGCTCAGAAGGAGGACTCAGGACTGTACCGCTGTGAGACGCCCGCCAGGCAGAGTCATCAAGTGGAAATTATTGTTGAAGATGTTCACTGCCCTCCTCTGCCACTGCGTCGCGGCCTGGTGGCCAGCTCGTCCGGGACCCAGCTGGGCACCGAGATCACCTTCCACTGCGCCAACGGAAACGCACTGCTGGGGGCTCAGACCCTCGTGTGCCGCGCCTCCGGCAACTGGAGCGCGCCGCTGCCAGTGTGCGAGAGTGTGGAATGTGGGGAGGTGGTCCAGGACAGTCCCCTGGGGGAAGGCGAACGCCGTCCTCGTGTGGCGGTGGTGTCGCGAGGAGTAGGAGGTCGCGCTGCCTTCTCATGTCCCCCAGGGTACGCGCTTTCCGGACCGACGGAGACCGTGTGTCTACCAGCTGCCGACTGGGCCAGGCCGTTTCCTATCTGTAAAGAGGTGTCGTGTCCGTCGCTTCCTCCCCCGGCGTCGGGATACGTGCTGGGGCGAGCGCCCTACCGAGCCGGCGACGTACTGCAGTTCCACTGTAACCCTGAACACACGCTCCACGGACGACCGATACTGGTCTGTCAGGACAGTGGACGTTGGAGTGATAAACCTCCTACTTGTGCTCAAGCCTGCACGTACCCCGGCACAACCATCTCCGGTCGCATGTCGTCAGTTAAATTCTACTACAAGATCGGGGAGACGGTGTCCTTCACCTGCGAGCCCGGCTACAGGCTGAAGGGTGCTCCTATGTTGAGATGTCTCAAAAACCGGAAATGGTCGAACGCAATCCCTCTGTGCACTCCCATCAGTAACTACACAGCCTCGGATTCGGTGTTGAACGCGGATATCGACGCTATGTTGTCGGACGAGGCTTACCTTCCGCGGGACATCCTGAAGCGACGGGAGTCTCCGGCAGTACTGTCCCCTGAGAGCTATAAGGCCGCCATGACCAGGGTCGCGGCTTCCAAGATGTTGTGGCGCTCGTCCATCGAGACCTACAAGAACAGAGTGTTCAGAGTCAACAGATTCATTAGACGAGACACTAACAGATAA

Protein sequence:

>DPOGS213342-PA
MHPLMLGAAGCTPVQPSSITELSLQLRVACSYRQVLCMYSRKKVGLPAAATDVPRSSLSPARTEQFETHCVMISCHSRSNRECPELDQPEIGKVSLSGRLFGDKAVYSCPHGYHVVGLQSRSCQADGKWAGQAPACKENIYCLQPPTIEHARHSALPEQATFDLDATVQYNCHTGYVTNGFPRAKCLAIDGQASWYGPDITCEPRSCGEPGDVPHGWVTADCHTFGCRAVVQCGQGFELVGKAERYCQADGAWAPKELPTCVLVTQVQCPPPEAPRHGKAVYTSCAYNSVVSYECKYGYRLVGDATRRCGADKKWSGTQPVCKEINCGHPGQLWNGWLENISSGTGLGASIIFRCQDGMKMEGNGSAICQSDGTWSHPLPQCLAPCVVPHVSQGKVVLMENRTNDNETKEHTQIVGSSSMVQHGEMIVVDCEKNYEFPSNNVAVTCNNGTWTQIPRCQPARCKKMPKYPRHGMVIAPKTEHGMKARFRCKDGYELKGNPIVVCSFGVWSGETPKCEEVFCPFPGYIENGKVLLVGNMGLYDYRPYVKKVVNNKQIMYECEKGYVLSEGPPGATCVGGHWSPRELPKCTLYQHPRIRWSRRRRSIPEVELRHKRSAYLRQYYNKLRQSDASQEAVDELYDKYNPENQKPTLRHANFKLMKKFDLEYEGEEDGMIGSDIPDNGNFPTAIYTVYNVHGEPVGHKFYSYQPVYEPEDDELVHRVDSMEYEDSAGLENDHLTILKGGMFPDNNIDNTNEVSENINKLKHQYFERYVDKRRKRFLTLKPTDSSGKEIDFDLNSTDSNQHTTVVVKRNVKTRVKRNVNLEESDTEVSTKHSILMESLFNESDPESLKVIPLPPSTTELRNDKVINSQKELAFFLEQYNNKSSGGGNIGEKREKRTSRKTDRLNQTTASVPKTGREGKGGRNRQAQGNAEETDSQSNENNESGNKNGKKGRPKSPCEPIESEPYVNIEIVKYGREPNNTFSSGTIVRVACGKGYGLNLELNATAKCVRGRWKPEKPKCEILPCHVPSTEYGIYTMSPSASPASVVSGLSQGDEKELNETDPVPNGQVVHFSCEYGYNVQGPTNLRCWLGEWAVTSMPECVAAPCELPILNGATYEAGYRSGLTVAHASSVNIACEAGRSPPATLNCHLGRLQPTVIDFCRPLANLSRPRPTSEYQSGSDIVREDVSELEPELLGRAAECGPPARVQGTLIYRDGSEVNETVEMNGYPHGTEVTFRCIASIMGEKTTWKLICENGNWVGKSFNCEDLEAQSEELLSNNSCNFRNEEPHVVSFFNDLEITETVDFPPGSVIVSRCSDIGKYAMTGSQVRRCVGGSWDGVKPACFGLNQENDYAMEKPPTILFRHSQGPIAQTNDGRLLVYPGAVLHMECLWIRRFGNPKWNVTLNDPELERTYAEGWTTDPGRDSQLEYRLSIINAQKEDSGLYRCETPARQSHQVEIIVEDVHCPPLPLRRGLVASSSGTQLGTEITFHCANGNALLGAQTLVCRASGNWSAPLPVCESVECGEVVQDSPLGEGERRPRVAVVSRGVGGRAAFSCPPGYALSGPTETVCLPAADWARPFPICKEVSCPSLPPPASGYVLGRAPYRAGDVLQFHCNPEHTLHGRPILVCQDSGRWSDKPPTCAQACTYPGTTISGRMSSVKFYYKIGETVSFTCEPGYRLKGAPMLRCLKNRKWSNAIPLCTPISNYTASDSVLNADIDAMLSDEAYLPRDILKRRESPAVLSPESYKAAMTRVAASKMLWRSSIETYKNRVFRVNRFIRRDTNR-