Monarch geneset OGS2.0

DPOGS204151
TranscriptDPOGS204151-TA2568 bp
ProteinDPOGS204151-PA855 aa
Genomic positionDPSCF300034 - 759035-802094
RNAseq coverage11x (Rank: top 84%)
Annotation
HeliconiusHMEL0096708e-7442.69% 
BombyxBGIBMGA005033-TA8e-13074.11% 
DrosophilaCG14372-PC8e-10833.81% 
EBI UniRef50UniRef50_E2AKZ87e-13838.46%Protein turtle-like protein A n=7 Tax=Formicidae RepID=E2AKZ8_CAMFO
NCBI RefSeqXP_975559.29e-16940.00%PREDICTED: similar to CG34113 CG34113-PP [Tribolium castaneum]
NCBI nr blastpgi|1892395172e-16740.00%PREDICTED: similar to CG34113 CG34113-PP [Tribolium castaneum]
NCBI nr blastxgi|1892395171e-16640.33%PREDICTED: similar to CG34113 CG34113-PP [Tribolium castaneum]
Group
KEGG pathwaygga:4282532e-12 
 K06491 (NCAM)maps-> Cell adhesion molecules (CAMs)
    Prion diseases
InterPro domain[202-284] IPR0137837.9e-15Immunoglobulin-like fold
[202-284] IPR0131622.1e-11CD80-like, immunoglobulin C2-set
[33-147] IPR0035994e-08Immunoglobulin subtype
[35-146] IPR0131062.3e-07Immunoglobulin V-set
Orthology groupMCL10437 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204151-TA
ATGTGCTCCATGAAGTGGCTCCGACTGCTTGTGATGCTGGCTGTCTGCTGTGCTCAAGAAGGAGAATGGAACGAAGATGCGGAGGACTTAGTGTCAACAGTGGATGTAGATGCCGTGCTGGGGCGTACAGCATCGCTTCCCTGTGACGTTACTCCAGATACAAATGAGGATAGAGTATACATGGTGCTCTGGTTTCGAGCAGGGAAGAGTACCGGCGGAAAACCTATTTATAGTTTTGACGTTCGTGGAAGATCATTTAACAAAGCTTTACAATGGTCAGATCCTAATGTTTTTGGTCCACGGGCTTATTTCGCCACCGTTGCCAGACCGGCCTCTTTGACGCTTGACACTGTCCAATTGGATGATGAAGGAGTATACCGCTGTCGAGTCGACTTTAAGAATTCTCCCACACGCAATTTCCAAATCAGGTTGTCCGTTATTGTTCCACCACATCAACTGATACTATACGACAAGTCTGGGCGAGATGTTTCGGGCGTGGTGGGTCCGCTCGAAGAGGGCAACGAGCTTGTCCTCGTTTGCGAGGTCAGAGGAGGAGATTACACAGAAGAAATATTTTCTCTTGCCCTCCTGGAAAAGAACGCTCCTGTGATACAGAACAAACCAGTGACGTTATCTTCGGAACGCTACGTCAGCTTATCGTGTGTGTCAGAGGGAAGCCGCCCGCCCGCGCAGTTGACCTGGTTCAAGGACAATAGAAAATTTAAAAGAGGAAAAATAACAGATGCATCTAATGACACTTGGGTGAGCAGCACGCTGCAATTTATGCCACTTCCAGAAGACGATGGCGTCCAAATAAAATGTCAAGCAGATAACAACGCGCTTCCCGGACAAAGCATAGAAGATTCTTTCAAACTGGATGTTGTTTCGCCTAAAAATGAAAATTATAAATTGCATAACGTGGAAGATGAACAACCTACTTATGGCTTGTCGGATAAATTTTACAAATACCTATTCTGTAAAATAAATGTTACAAATATTATTGCTCAAGATATGGATGCATTACCACTTGAGGTCAAAATCTCGGAAAAACCAGTACTCCATAACGTGGTTTCTGGCATCATTGTCAGCACCAAGTCATTAGTGCTACAAAAGGTTACAAGGGACTATAGCGGAGACTACTCTTGTCGTGCCACCAATGCCCTCGGAGAAACAGCGAGCCAAGCTACTCATCTTAGTATACAATATACACCAGTTTGCACGCACACTTCACCTCAAGTGTTAGGTGCACAGATAGACGAAGCCCTACTTATACGCTGTTCTGTTACCGCCAATCCCCCAGATGTCACCTTCTTTTGGCAGTTTAATAATAGTGGGGAGAGCCTAGACGTATCCCCGACTAAGTTTGGCACAGCCAACGGCAGTACAAGCGAGTTAAGCTACAAGCCTCAAAGCGAACGCGACTATGGAGCTCTGAGTTGCCGCGGTACAAATTCAGTCGGTAGACAAGACGAGCCATGTGTATTTCAAATAGTACCTGCATCTCGCCCAGCCCCACCTAAAAACTGTTCTCTTCACACCGGATCCAATAGTTCTGAGGGGCTGAACTGGTTGATGGTACGCTGTGTTGCTGGATATGACGGAGGTCTACCTCAAACCTTTGTGCTGGAAACACTAGATCCCATCACTAGCAAGACTAAGTTCAATAGCAGCGCTAACGATACAGATGGTTTAGCTACTTTCAAGTTAGATCTCTCGCAAATATCGGCTGGCGAGACCGAGACCACATTTAATCTACTCATCTATGCAAGGAACCTCAAGGGAGATTCGGAGAAAACTCTGCTGGAAAATATTGCTTTCAATGACGCCGCCAGGAGAACGGATGGCAAAAATGTATTGGGAGGAATAACGTTTGGAATGGTAATTGCTGCGTCACTTGGAGCCGTGTTTGCCGTTGGAGGAATAATTTTCGCCGCGCTCTGTGCACGTCGAAAGAGATCTCATCCAACTCATAAACATCCTCCAGGTGATATGCTGGAGTTAAGTGATGGGTGCAGAAGATATGTTGTAGCATACACTATCAAACCATCGCAAGAACTTAAAACGCCTGATCCACAGCCAGATATTTTAAATCCACCAGATGGCGAAAGTCAAAAAGCGCCAGCTTCGACGGTTGAAGCCGATGAGTGGCCAAGTGTAAAAGAAGTTAGAGGGGACTGGAATAAAACAGGAGCTGTCTTTTCAGCAGAGGACCTTGCACTTTTAGATTCAACTGGACATCCACAAGAGATAACTTCAAGAAATGAATTGGTTCATAACAGTCGGCAGGAAGACGCTCTTAATCAGAATTTATCTCAACCTATATTAGCAAGTAATTTTAGGCCCAACTTTTTAGTAACAAATAGTCCAACATTAGGAAGTCCAAACTTTGTGTGTCCTAATGGTAATATTGGATCACCTTTTGAAAGTCAAACGCTCACACTAAGCGGACCAACTTTAAGTTCTAGTAGTCTGTCTAATTCTACGTTACATAGAAAAGGCAAGAGTAATACACGAAGAAGAGAACATGTTCTCGCAGAAAACTTGCCGGGTCCTGAGAGCTGTGTTTAA

Protein sequence:

>DPOGS204151-PA
MCSMKWLRLLVMLAVCCAQEGEWNEDAEDLVSTVDVDAVLGRTASLPCDVTPDTNEDRVYMVLWFRAGKSTGGKPIYSFDVRGRSFNKALQWSDPNVFGPRAYFATVARPASLTLDTVQLDDEGVYRCRVDFKNSPTRNFQIRLSVIVPPHQLILYDKSGRDVSGVVGPLEEGNELVLVCEVRGGDYTEEIFSLALLEKNAPVIQNKPVTLSSERYVSLSCVSEGSRPPAQLTWFKDNRKFKRGKITDASNDTWVSSTLQFMPLPEDDGVQIKCQADNNALPGQSIEDSFKLDVVSPKNENYKLHNVEDEQPTYGLSDKFYKYLFCKINVTNIIAQDMDALPLEVKISEKPVLHNVVSGIIVSTKSLVLQKVTRDYSGDYSCRATNALGETASQATHLSIQYTPVCTHTSPQVLGAQIDEALLIRCSVTANPPDVTFFWQFNNSGESLDVSPTKFGTANGSTSELSYKPQSERDYGALSCRGTNSVGRQDEPCVFQIVPASRPAPPKNCSLHTGSNSSEGLNWLMVRCVAGYDGGLPQTFVLETLDPITSKTKFNSSANDTDGLATFKLDLSQISAGETETTFNLLIYARNLKGDSEKTLLENIAFNDAARRTDGKNVLGGITFGMVIAASLGAVFAVGGIIFAALCARRKRSHPTHKHPPGDMLELSDGCRRYVVAYTIKPSQELKTPDPQPDILNPPDGESQKAPASTVEADEWPSVKEVRGDWNKTGAVFSAEDLALLDSTGHPQEITSRNELVHNSRQEDALNQNLSQPILASNFRPNFLVTNSPTLGSPNFVCPNGNIGSPFESQTLTLSGPTLSSSSLSNSTLHRKGKSNTRRREHVLAENLPGPESCV-