Monarch geneset OGS2.0

DPOGS207544
TranscriptDPOGS207544-TA2754 bp
ProteinDPOGS207544-PA917 aa
Genomic positionDPSCF300598 - 443-15842
RNAseq coverage12x (Rank: top 83%)
Annotation
HeliconiusHMEL0045360.043.28% 
BombyxBGIBMGA009343-TA3e-10136.77% 
DrosophilaCG12006-PA2e-2359.04% 
EBI UniRef50UniRef50_E0VC721e-7526.17%Hemicentin, putative n=2 Tax=Eumetazoa RepID=E0VC72_PEDHC
NCBI RefSeqXP_002423716.12e-7626.17%hemicentin, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3637452082e-7626.67%PREDICTED: LOW QUALITY PROTEIN: hemicentin-1 [Gallus gallus]
NCBI nr blastxgi|3016084701e-8328.59%PREDICTED: hemicentin-1 [Xenopus (Silurana) tropicalis]
Group
Gene OntologyGO:00065062.8e-23GPI anchor biosynthetic process
GO:00167572.8e-23transferase activity, transferring glycosyl groups
GO:00312272.8e-23intrinsic to endoplasmic reticulum membrane
KEGG pathwaydre:3177315e-23 
 K12567 (TTN)maps-> Dilated cardiomyopathy
    Hypertrophic cardiomyopathy (HCM)
InterPro domain[15-88] IPR0055992.8e-23GPI mannosyltransferase
[772-899] IPR0137836.6e-21Immunoglobulin-like fold
[823-899] IPR0130984.3e-14Immunoglobulin I-set
[825-889] IPR0035982.7e-13Immunoglobulin subtype 2
[819-900] IPR0035997.6e-09Immunoglobulin subtype
Orthology groupMCL15420 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207544-TA
ATGGCTTTGGCAAAGGGCCTGAGGCCCGTGCAGGTTGGGGCAGTAATATTGGCGGTGCGTATTTTATCCGTGTTCCTAGTGCAAACCTGGTATGTGCCGGATGAGTATTGGCAAACCTTAGAAGTGGCACACAAATATGCCTTCGGTTATGGAGCCCTGACTTGGGAGTGGCAAAAGGGGATACGAAGCTATCTATACCCCAGTGTAGTCGCTGTGCTTTACTCCGTGTTGAAATTCACTGGCCTTGATTATCCAAATGTTGTGGTTTTAGAATTCGTTCGGAGTTCTATCCGGGGACGTACAGTTAACCTTGGTTCATCAGTGAACCCCCCAGGATACAATTACACTCAACAGATTCCAGTCGATAAGACAGTGGGTGAAGTAACAGTGTCTGTGTCGGGAGCAAAGCCCAGTATTAAAGTAGTGAAGCCAAGCGGCGAGGAACTCACAGGCCCTCCACAACTCGTCACTATATTGGATCTGTCTGAGATAATGATAGTGAAGGTGTTATCTCCAGAGCCTGGTCCGTGGCGTGTGACGGTGGGCAGCTCCGAGCCTCACTCGGTGAGGGTCAAAGGTCTCTCTGAGCTCTCCTTCACTCATGGATTCTCTGTCACAGACGTAACAAGCCTCAATCACACCAGCTACAGACCCTTAAAAGGTACATATAACAACATGTTGATATCCCTTCCCGCCAATACTTCCATCAAGCTGGACCACGCGGAACTCCTGGATTTGAAAGGGAAACCGTTGTTTGAGATACCTCTGAAGAAAATAGACGCGACCAGTAACGTATACAAAGCGGACGCCTACATACCACCCGAGGAGTTTTTTAATATTGCAGTAGTAGGTGTGGACTCCTCGGGCAACGAAGTCCGCCGGAGCAGCCCCACCGCCGTGAGCGCGGCGCCGCCCGATATTCCCAGTGTGACGGTACCGAAGAAGATAGTGTCTCACCCACACTCTCGTGTCGTACTACCATGTTCCGTGGACAGCGTCGTGCCGGTCACCGTTGTTTGGACTCGACGCGGTATCGACCTTCACGAACACACTCACAGTCTACGAAGCACTACAGCTGAGTATGTAATAAAGGACGTGTCTGAAGCCGATGTGGGTACGTATCGCTGTGTCGCGAGTAACGCCGCGGGGCGCGCAACTGCGGAGACGAGCGTCGATATGATAGCGTTGCCCCCACAAGTGACGGTGACTCCCACGAACGTCACCGTGACAGAGACCGAGCGTGTGTTGTTGACGTGTTCTATATACAGCGAGACGTATCTGCACCGAGCCAGGATCGAGTTCCAAGGGGATCTGAAACATTACGATATCAAATTGGAGCCATCGATCGATGGTTTGTACACATTAAACAGAACCATTGAAGAGGCTAAGCAAAATGACAGCGGTATTTACACCTGCATAGCCGCAAACAGAGGGGGTGTTTCAAACCAGTCCACCGAACTCACAGTGGTACCGAAACCCACAGCTCTCATCCTCGGTCCACACACGCTAACAAAGCTCCTACACTCAGACATCCAACTCGTCTGTCACGTGGAAAACGCTGTGACAGTAATTTGGACGTACAACGACATAACGGTGGCGAGTAACGAAGTTAACGGGAACTATAACGACGTGATGAATGTGGATAACGTGAAGGAGGACGGCGTCTGGACCTGCTCGGCCAACAAAGGCACATATAGCGCATCAGATTCCGTCCAAGTGACCGTACATATCAAACCCGAAGTTAGCATAGTCGGTTCTAAGAACGTGTCTCTGCCACAAAACAGCACTTACGATGTCGTATGTACAGTGGTCGCTCGACCACAGCCACGCGTGTTGTGGCACAAAGAGACCGAGGAGTTTTTATATCACGTTCTAACAAATCCTGAACCAAATTTATACAGAAGCGTGTTGACTGTAAACGGTACGAATGGTACTTATTTCTGTATCGGCGAGAATTCTGAAGGCATCCATCAAGATAGTGTGGATATTAATGTTTACTCTCCAATGATTTTGGAACGACAACTTAACGATACAACCGTAGAATTATACTCACCGGTCAAATTCCATTGCCAAATAAGCGCATATCCCAAACCTAATATAAAATGGCGTCATAACGACACGCACGTGACACGGGATGATAATGTCGCCGTTGAAAATGATGTCCTTATTATAAAGAGGGTGGATTTTGATAACCTCGGAGTGTATTTATGTGAAGCTGATAATGGTTATGAGAAAATTACAGTGAATTTCAGCTTAGGTTTACACGGACTGGCAAAACCGCTTATATATAAGGAACATGAGAAGATTGTTGTTCGTATAGAAAATGTTAATAAAAATAACACGGGCGCGTACAGATGTGAAGCGAGTAACGTCATAGGAGAAGACGTTCATGAACTGACAGTGAGTGTGCAATATCCGCCAGAACTACACTCAGACCAGGAAGCCTACAAGATGGAAGGTCCACGGCAAGTCAGGATGGGCGACGCGATAAGCCTGAACTGTAATGTGACCGGCGACCCGCTTCCGCTCGTGACTTGGACCAAAAACGGCTTACCGATAAATTATTCAAAAATACGTCAACATCTGCACGGAGACACTTTGGTGATCGAGAGCGCCACCAAGTTCGATTCCGGCGTCTTTGTGTGTAACGCCAGTAATGTCCTGGGATCTACGTCGCAGAATTTCACTATAATTGTATATGGTGAGATCATTTGTGCTACAAATAATGAGCTAAAAATTCTATTTAATATATAG

Protein sequence:

>DPOGS207544-PA
MALAKGLRPVQVGAVILAVRILSVFLVQTWYVPDEYWQTLEVAHKYAFGYGALTWEWQKGIRSYLYPSVVAVLYSVLKFTGLDYPNVVVLEFVRSSIRGRTVNLGSSVNPPGYNYTQQIPVDKTVGEVTVSVSGAKPSIKVVKPSGEELTGPPQLVTILDLSEIMIVKVLSPEPGPWRVTVGSSEPHSVRVKGLSELSFTHGFSVTDVTSLNHTSYRPLKGTYNNMLISLPANTSIKLDHAELLDLKGKPLFEIPLKKIDATSNVYKADAYIPPEEFFNIAVVGVDSSGNEVRRSSPTAVSAAPPDIPSVTVPKKIVSHPHSRVVLPCSVDSVVPVTVVWTRRGIDLHEHTHSLRSTTAEYVIKDVSEADVGTYRCVASNAAGRATAETSVDMIALPPQVTVTPTNVTVTETERVLLTCSIYSETYLHRARIEFQGDLKHYDIKLEPSIDGLYTLNRTIEEAKQNDSGIYTCIAANRGGVSNQSTELTVVPKPTALILGPHTLTKLLHSDIQLVCHVENAVTVIWTYNDITVASNEVNGNYNDVMNVDNVKEDGVWTCSANKGTYSASDSVQVTVHIKPEVSIVGSKNVSLPQNSTYDVVCTVVARPQPRVLWHKETEEFLYHVLTNPEPNLYRSVLTVNGTNGTYFCIGENSEGIHQDSVDINVYSPMILERQLNDTTVELYSPVKFHCQISAYPKPNIKWRHNDTHVTRDDNVAVENDVLIIKRVDFDNLGVYLCEADNGYEKITVNFSLGLHGLAKPLIYKEHEKIVVRIENVNKNNTGAYRCEASNVIGEDVHELTVSVQYPPELHSDQEAYKMEGPRQVRMGDAISLNCNVTGDPLPLVTWTKNGLPINYSKIRQHLHGDTLVIESATKFDSGVFVCNASNVLGSTSQNFTIIVYGEIICATNNELKILFNI-