Monarch geneset OGS2.0

DPOGS213249
TranscriptDPOGS213249-TA4089 bp
ProteinDPOGS213249-PA1362 aa
Genomic positionDPSCF300124 + 247017-258240
RNAseq coverage160x (Rank: top 52%)
Annotation
HeliconiusHMEL0165393e-17739.16% 
BombyxBGIBMGA009442-TA6e-6527.80% 
Drosophilarobo-PB2e-3125.42% 
EBI UniRef50UniRef50_E1B9K43e-9826.92%Uncharacterized protein n=9 Tax=Theria RepID=E1B9K4_BOVIN
NCBI RefSeqXP_002423716.12e-9426.30%hemicentin, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3584146435e-9827.01%PREDICTED: LOW QUALITY PROTEIN: hemicentin-2 [Bos taurus]
NCBI nr blastxgi|2420057327e-9926.30%hemicentin, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00468724.2e-12metal ion binding
GO:00167874.2e-12hydrolase activity
GO:00036764.2e-12nucleic acid binding
KEGG pathway 
InterPro domain[666-758] IPR0137831e-18Immunoglobulin-like fold
[598-683] IPR0130986.1e-14Immunoglobulin I-set
[1193-1333] IPR0016044.2e-12DNA/RNA non-specific endonuclease
[699-761] IPR0035985.3e-12Immunoglobulin subtype 2
[1192-1335] IPR0208218.6e-12Extracellular Endonuclease, subunit A
[782-864] IPR0035997.4e-08Immunoglobulin subtype
Orthology groupMCL29789 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213249-TA
ATGTTGTTATTATTTATATCAATGGTCTTATTCGGTTTTACAAACGGGCATAACGCGAAAAGCAGCCTCACATTCGTTATAGATGATACGGGATCTATGTGGAATGATATTGATCAGGTAAAAGAGAAGACAAACGAAGTGTTCGATGCAGTGCTCAATTCTAACGCATCAAAAATAGACGATTTCGTGCTTGTCACCTTTAATGATCCAGATGCAAAAGTCTGCACAGTGACCCGAGATCGCAAAGAATTTAAAAAGGCTCTTTCCGATATTACTGTAGACGGTGGTGGAGATTGTCCTGAGTACTCAATGAAAGGAATCCAACTTGCGTTAGAGCACAGTAAACCGAATTCGTTGTTTTATGTTTTCACGGACGCAGCGTCAAAGGATTACGAAGAGTACGAAAAAATTAAGAGCTTAGGGCTCAAGAAGTCTATTCAGGTCACATTTTTACTGACCGGCGAATGTACGAACACTCCAGAAGAGGCATTTACTGTTTATGATAAATTAGCGGAAACTACTTCAGGACAGGTTTTCCACTTAGACAAACAGGATGTGAGCAAAATAATAGACTATATCATAGCAACGATAAAAAACAAAAAGACGACTGTGGCCCAAAAAACTTTTTATAACGGTTACGGCAATGAGTTCAAATTTTCTATAGACAGCAAATTATGGGATGTAATGATTTCTGTATCAGCTGACGATCCGAGATTTCACTTGAATGGTCCTGATGGGGAATCAGTAGATGTTAAAGAATTCATCAGTACAAAGAAATCCAGTATATCTAAATTGGATGTTAAACCGGGGATATACACTATGGTCCTGGATAATATTGGTCAAACCTCAGTGGTTATAACTGGATCCACATATGTTTGTTTTCAACATGGCTTCTCTACTGTCATGCCTAGCACATTGAACGAAACGTCTACTAAACCTATCGAAGATACACCATCTTATTTAGCTATTGAATTGGATAATGTTAACAGAGATGTTATATTAGACACAGTAGAAATTAGAGATATTAATGATAATATCTTAAGTGCTTATCCATTGGACCTGCTAAATAAGGACAGTCAATTCTACGTAACAAAACCAATTCTGACACCTGATTCTACCTTCAAAATCGCAATAAATGGTCACACAAGCACTGAAGAGAAAATAACTAGAATTGCACCAACGGCTATTGAGCATCAAAAACCTGATTTAGAAGGACCAAAGCGAAAAGCACCGATGGTCACAATTTTAGAAGGAAGTATCACAACTGTTGAATACGATTCAAATTTATCCCTAAAATGCAAAGTGCACGCTTTTCCTAAACCGGATATTGTGTGGAAAGATGACTCAGGAATGATCTGGCCATCAAAAGTAGTCCCTGTGGATTTGCCTTATGATTATATGGGTATTCTGGATAAAGATAAAATAAATAAAAATATAACTTTATATTGTACAGCGAAAAATGAAATCGGTGAAGATAAAAAATCTATTTTAGTTGAAACTAAAAGAAATTATTTCCTGGAAATTCTCGAAAGTCCCAAAGATTTGGTTATTGAGTATGGAAGTTCTAGTGTTCTTAATTTCAAAGTGAATGCATATCCAGCAGCAACAATAGGGTGGTATAAAAAGAGGAAAGAACTTTTTAATGACGATGATTACGAAATATCTGCAGATGGTTCCACACTAAAAATAAAATACATGCATCAAAGTTTAAGAGGATTTTATTCAGTGAAGGCTATGAATGAAGAAGAAAAAAAGATAATTTACTTCAAAATAGATATGTCTGGAGAAAAGCCTGAAATAGACAAGACTGTCAGTTCTTACAGAATAGAGAAAGGATCTAGTGCCAATTTAACGTGCAGAATCCTAAAAGGTAAACCAGAGCCTGAAATTTCTTGGACTTTTCAAAATGAATCTCCAGGTTCTTTAAAGCGTTTAGATGTTGTTGGCGATCTTTATATTGATAAGGTTGGTCCAGAGAATATGGGAATATACACCTGCAAAGCTCGAAATGAATTTGGAAAGGATCGTCACGATATTGACTTGTTCGTTGGATATGTCCCAACAATCAAGAATGTTCAAACTGAGGTTTTAGTGCCGGAGGGTCAACAAGTCATATTGACTTGTATAGTCGACGGCTCTCCTTACCCTTTCGTTCGTTGGCTACTAAATGATGTTGAAGTAACAAGAACAGGAAAATATTCATTCAATGACAATAGACTAAGTTTTACAGGTTCAATTGACGATAGCGGCATATACACCTGTGAAGCTTCAAACAGTTTAGGACGAACGCAAAAAGACTACGATGTCGATATTTATATTCCAGTAAAAATGCAGGTGCCAAAAGACACAACTTTAAAATTAGATGTTGGAAGTTCTACGACATTGCCATGCGTCGCTGAAGGTTATCCAAAACCAAACATCAGATGGACTTATTACAGCAAAAATCCTAGCATTCGTCCTAAGACATTGAAATTTGATGATACCGGCTCATATAACTTAGAACATATTCAAATAGAAGATGAAGGTTTCTATAGGTGCTCAGCAAGTAACGTCGGAGGATTAAGTAGTGTTACCTACGAAGTTTTTGTCAGAGCCCCTGTATCTATTACGAATCCAGATGGAGTTGTTTTTAACGCTGTAAAGGGAGACCTGGCGCTTAGGATCCCCTGTAATGCCATTGGCAGCCCGAAACCTAAAGTAACATGGATGGCAAATGGTGAACACATTGCTTCAGGGACTGATTGGTACGACATAGAAGATGATGGCACTTTGAACGGAGAACTAATAGCAAGTGATGAATTATATTTAAGCCACGTTAAATTTGAAGATGCTGGTATTTACAGTTGCCGTGTTAGTACGTTTTTATCAGCTCACACTGCACATAAAAAGGTGACTGTAGGTTATAAGCCAAGATTTCTCAGTGACGAAGAAACTGTTATAGAATATTCGGAAGGCGATTTCTCTTATATGGACTGCAATGCCGATGGCTATCCAAAGCCAAGCACGCAATGGATACGTAATGGTGATCCTGTACCTATAAATGGGTCGTATCTAATTATAGAAATGAAACTTGAAGATATCGGATACTACCAATGTACCGTAAGCAATGATCTTGGTTCAATTAGACGTACTTTTAAAATTAATTCAGGAGAATGCCTGCTGCGTACTAAGCATGATTTTAATGATCAGCAGCCTTTACTTTTGACTCTATCCAGAGACTGGCCAGAATTTAGAACATCAAATGAATATGTCCATATACCAATTTATAAATATTTTCTTCTATCATGCCCCGGCAGTTCTGTATACGACGTTTGTTTAGATCACGAACAGAAGATACCTCTCTTTGCAAAACAAACCTCAAATAAAGGTATCGCCCTGAATGCACCCCCCGGAGATTACACATTTGTTGAAAGTAAATATTTGCCCTTTCATTTTGGGGACATGTATGACTGTGATTCTCAGTTGAGATTTATTTCATCGTCGATCGGAAAATCAATAAAACCAGTTAAAGATGTTGAATGCTGTTTTACAAAAAGACAATTGATCAATCCTCGAGATGTTTTGCCGGGATTATCACAAGTGGCTGTATATAGCTATTTAAATGTTATACCTCATTGGAGTACCTGTGGAACTAAAAACTGGGATGAACTTGAACTAAGAGTACGATATCTGGGAAAATATTCATCTAATGAGCTGACCATTTTCACTGGAGCATCAGATCCGATGATGTTGCCAGGACAGACAGAAGATGCTTATGTGTCCTTAAGAGACAGATTAAACAGACGTCAACCAGTGCCCATGTATTTATGGAAGATAATTCAAAACCCGGCAGATAATTCTTCCTTAGCTGTCATCCAACTAAATATTCCTAATGTTACGTCAGCGGAGGCCTATTCTTATATGCCATGTAACGATATATGTCCCGAAGTCGAGTGGTTGCGTAATAACGATTGGCAGGATGTGAATAAGGGATTCACATTCTGTTGCAGTATTAGTGATTTTAATTCACGTTTCGGCAAGCTTTTTGACGGATGTGAAAAAGTATTCAAGACTTTACCACCTTTATTACCTGATTTTTCTCTTATCTAA

Protein sequence:

>DPOGS213249-PA
MLLLFISMVLFGFTNGHNAKSSLTFVIDDTGSMWNDIDQVKEKTNEVFDAVLNSNASKIDDFVLVTFNDPDAKVCTVTRDRKEFKKALSDITVDGGGDCPEYSMKGIQLALEHSKPNSLFYVFTDAASKDYEEYEKIKSLGLKKSIQVTFLLTGECTNTPEEAFTVYDKLAETTSGQVFHLDKQDVSKIIDYIIATIKNKKTTVAQKTFYNGYGNEFKFSIDSKLWDVMISVSADDPRFHLNGPDGESVDVKEFISTKKSSISKLDVKPGIYTMVLDNIGQTSVVITGSTYVCFQHGFSTVMPSTLNETSTKPIEDTPSYLAIELDNVNRDVILDTVEIRDINDNILSAYPLDLLNKDSQFYVTKPILTPDSTFKIAINGHTSTEEKITRIAPTAIEHQKPDLEGPKRKAPMVTILEGSITTVEYDSNLSLKCKVHAFPKPDIVWKDDSGMIWPSKVVPVDLPYDYMGILDKDKINKNITLYCTAKNEIGEDKKSILVETKRNYFLEILESPKDLVIEYGSSSVLNFKVNAYPAATIGWYKKRKELFNDDDYEISADGSTLKIKYMHQSLRGFYSVKAMNEEEKKIIYFKIDMSGEKPEIDKTVSSYRIEKGSSANLTCRILKGKPEPEISWTFQNESPGSLKRLDVVGDLYIDKVGPENMGIYTCKARNEFGKDRHDIDLFVGYVPTIKNVQTEVLVPEGQQVILTCIVDGSPYPFVRWLLNDVEVTRTGKYSFNDNRLSFTGSIDDSGIYTCEASNSLGRTQKDYDVDIYIPVKMQVPKDTTLKLDVGSSTTLPCVAEGYPKPNIRWTYYSKNPSIRPKTLKFDDTGSYNLEHIQIEDEGFYRCSASNVGGLSSVTYEVFVRAPVSITNPDGVVFNAVKGDLALRIPCNAIGSPKPKVTWMANGEHIASGTDWYDIEDDGTLNGELIASDELYLSHVKFEDAGIYSCRVSTFLSAHTAHKKVTVGYKPRFLSDEETVIEYSEGDFSYMDCNADGYPKPSTQWIRNGDPVPINGSYLIIEMKLEDIGYYQCTVSNDLGSIRRTFKINSGECLLRTKHDFNDQQPLLLTLSRDWPEFRTSNEYVHIPIYKYFLLSCPGSSVYDVCLDHEQKIPLFAKQTSNKGIALNAPPGDYTFVESKYLPFHFGDMYDCDSQLRFISSSIGKSIKPVKDVECCFTKRQLINPRDVLPGLSQVAVYSYLNVIPHWSTCGTKNWDELELRVRYLGKYSSNELTIFTGASDPMMLPGQTEDAYVSLRDRLNRRQPVPMYLWKIIQNPADNSSLAVIQLNIPNVTSAEAYSYMPCNDICPEVEWLRNNDWQDVNKGFTFCCSISDFNSRFGKLFDGCEKVFKTLPPLLPDFSLI-