Monarch geneset OGS2.0

DPOGS207309
TranscriptDPOGS207309-TA5382 bp
ProteinDPOGS207309-PA1793 aa
Genomic positionDPSCF300008 + 1446721-1459609
RNAseq coverage78x (Rank: top 65%)
Annotation
HeliconiusHMEL0071120.090.37% 
BombyxBGIBMGA012092-TA0.085.53% 
Drosophilaaxo-PC0.062.37% 
EBI UniRef50UniRef50_E2B6S30.064.44%Contactin-associated protein-like 2 n=8 Tax=Endopterygota RepID=E2B6S3_HARSA
NCBI RefSeqXP_394721.30.062.12%PREDICTED: similar to axotactin CG18296-PA [Apis mellifera]
NCBI nr blastpgi|3838529360.063.58%PREDICTED: uncharacterized protein LOC100875110 [Megachile rotundata]
NCBI nr blastxgi|3838529360.059.68%PREDICTED: uncharacterized protein LOC100875110 [Megachile rotundata]
Group
Gene OntologyGO:00055153.5e-07protein binding
KEGG pathwayhsa:260471e-69 
 K07380 (CNTNAP2)maps-> Cell adhesion molecules (CAMs)
InterPro domain[310-496] IPR0133203e-40Concanavalin A-like lectin/glucanase, subgroup
[324-486] IPR0089851.7e-33Concanavalin A-like lectin/glucanase
[120-256] IPR0126801.2e-16Laminin G, subdomain 2
[350-484] IPR0017914.7e-15Laminin G domain
[1104-1130] IPR0062093.5e-07EGF
Orthology groupMCL14695 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207309-TA
ATGGGATTGCGGGTAAGACGAGGCTTATACGTCACCATCATACTCATATCCTATGGAATAGCTGATGTTGACTTGAGTTTGCCAGTAGATTTAGCAGTAGAAGAGAAGGCACCACAGCAAACTACCCTCCCTGCCACTTCCACCTCACCTCTCCCAACCACCATTAACGTGTCAACATCTACATACGCCACGATCATTCCTCCCATCCCACCCAGCGATCGAGACAGATTCCTGACTTTCGCCGAATCAGGCCACCACCAGACTTTTATGTTTGCCAAGGACAATACCTACATACAGCTTGACGGAGATATTATCCAACGATTTCAATTAAGATTGTGTAGAGAAATATCGTTTAAATTTAGAACAAGACTTCCCCACGGACTTTTAGTTTATCACAATGTTAAAAATCCAGTGTTCAAAATGCAGCCTTATGCATTATACGTTATAGTTGAGAAAGGTGAACTTAAAGTTGTACATGTTTTTGGTAAGCACTTAACCTCAGTGACAGTTGGCAGAGCATTGAACAGAGATCAGTGGCATAGCGTGGTTGTTACCATAGATGTGCATGGAGCAAGACTTATAGCCAAAGTAGACAATTTAAAAGAAGAGGTTTATCTAAAGGGACTAAGTTTCGATACTAATTATGGAATAACAGATAATTTAACTTCAGTTATACTTATTGGAGGTTTAAGTTCTGAAGAAAAATTACATGGAGTGAAATATATCATTGAATCATTCGTTGGTTGCATCAGCGATATGGTTCTTAGTTCCGGTAAAGCTGCCTCCGACCTTCTACCGATAGTACCGTTGATCGCAACAAAACACGAAAACGTTAAAGAAGGTTGCATAAACAAATGTAAAACCATGGAAAACCTTTGCTTCGAAGGTTCTAGATGTATCAACGAATACAATGGGTACAGATGCGATTGTTTTGGCACTCTATACGAGGAACAGCTATGTGATGTGTATACGGCGACAATATTAACACTGCGAGGGTCGAGCTACGTATCCTACCGCGTCTATGATTGGAAGGATCGCGTTCATTCTACAAACACAAGGGTCAGCTTGCATTTTAAGACACGTTTTGACGACTCCGCTCTTTTTTATGCAAGCGGTCAAATAGATGACAAACACCATTACATAGCACTGTCGATCCATCAGGAAAAAGTCGGTATACAAATAGATTTAGGCGATGGTCCAGTAGAGGATTATTTAGGAGTGAGAGTAAATAATAATATGTGGCACAATATAACTGTTATATTGCAAGAAAAAACAGTTCACGTATACCTCGACAATATAAGCGCAATATACGAAGTACCGGGCGACGCGAGATTTGTTTGCATCGACCCAGAAATATACATTTGTGGTGGTCCAGATTTGCACAAAATGAAAGGTCTGAAATCTTTCAACAATTTCGCTGGAAATCTCAAGTACGTTTACTACAATGATGTATCGATTTTGTATGAGTTGAAGCAGAATAATCCAAAGGTCCATTATATTGGAGTATTGGATCCCGAATTCGAAGATATAGATATAGAGCTAATCCCGATAACATATCCTTTTGCCACATCGCACATATGGTGGCCTTTGAATCAGAGTAATAGTATAAATCTGGTTTTCGATTTTAAAACTAGCAAGAATATGGCTGTACTAGCTTACAGTCAGATAACTTCGGGACAAGGTTATTTCGAGGTGAGAATGGTCAAAGAAGAAATTCGGTTCGAACTAGTACCGGATGTTGGTAAAAATGTAACAGTTCTAAAATCTGTTAAATTTAACGTAAGCAACGACTGGCATAATGTGGAATTGGATTATAGAAAAGGAAGAATCAAACTTACTGTGGACTATCATAACAAACACGCTCAGATGTTCGGTTTAGATTTTCAATTACAAGATAAGATTGTAATAGGTAGTGGACTGAAATCAGCTAATCTAGGTCTTATTGGGTGCATGAGAAATATCAAAATAAACGGCCTACAAATTGAACCACGATACGTTATAAATACGGAACGCGTGGTGGGTGAAGTGGCTATAGATGACTGCCGCTACGTGGACCCCTGCACCAGACCGAACACTTGTGAGCACGACGGTATATGCTCTATACGGGAGGACAGAGTCATATGCAATTGTGATAACACCGGCTACATTGGTGAGAACTGCCATTTTGCTACCTTCAGGAAGACTTGTGAAGAGTTAGCTCTCCTGGGCTATACACGTAATGACGTGTACTTGATAGATATCGATGGTAATGGAAAATTTCCACCAGCTCACGTGAAGTGCGAATTCCAGATCGAAGCGGATTCATCTACTACTGTGGTGGAACACAATTTACCAAGCCAAGTGGATGTAAGGTCTGCTCTAGGGCAAGACTTCAGCTTTCATATAAAATACAGAGAATTCACAGCAGAAATGCTGCAAGAATTAATTTCACACTCGCTGTTCTGCCGTCAATATATTAAATACGACTGCAACATGGCCCCTCTGGAACTACACAGTGCTACATGGTTTATATCATCCTCGAATGATACCGTCGATTACATTGGAAACGTTAAAAAGGGTTACTGTCCTTGTGGGGTTAACGCAACTTGTGTTAATCCAACAAAATCTTGCAATTGTGACGCAAACGAAAACAAATGGCACTCTGATGAAGGAACACTCGTTGATCCCAAAAGCTTAGGCATAACGGAGATGTTCTTTCTCCAACAAAAGGATTTAACCGAAGAAGCGCAAGGCAGGATTACGCTAGGTCCCTTAGAATGTGTTGAAACGAACACCCAGCGCTACGTTGTGACGTTCACAACATCGCAATCCTATATAGAGGTGCCCGGATGGCGAAAGGGGGATATCGCTTTTAGTTTTAGGACGACGGGTACAAGTGCGATTCTGTTATTCCAGCCGCCTATAAGACCGAATTATCCATCGTTCATGGTCGCTTTAACAAGTGAACACGAATTAACTTTCAACTTCACCCTCAACACGGGTACCACGCGGAAGTTGGTCATCAACTCAAAGAGAAAACTAAATGGAGGGGAATGGCATAAAATCTGGATCGACTACAACTTCTACCACGTGAGGTTCATGCTCAACACTGAGTACCAGATGCTTAATCTTTTATTGGAGGAGGAGTTCGGACCTTTTGAGGGTTCTATGTTCATTGGAGGGGCGACTGCAGAACATTTAAAGAAATCAGCTGTCAACCAAGGTCTCATCGGATGCTTTCGAGGCTTGGTTGTGAATGGTGAAATACTTGACATATACAGTTATATGTCTGTTCATTTATCTGAAATCATCAAAGACTGCAAGCCATCCTGCGTCCCTAATCCCTGCCAGAACAGAGCCACTTGTAAAGAACTCTGGTCCACATACGAGTGTATCTGCAAGAACCCGTGGGCGCACTTGGGTGAACATTGTGAAGAAAATATCAATGAAAAAGCATTGACTTTCCAAACTAAGGAGTCTTATTTGAAAAAAAACTACCTGGTCGATAACACGACTGACGCAGAAAAAGCAAGATTAAAAAAGATGATGATAGAAAACGTCCTAATGAATCTGAGAACTTACGACGACAATGCACTAGTACTGTACGCTAATGACAATCTCAATAATTTCATACATCTCTTCATACACAATGGAACGGAAATTATATATCTGTTTAATAACGAAGATGAAATCGTTAAAATGAATGTTACTTACGAGAAAATTAACAAAGGGGAAAGTGTTCAGATTGCAATCATAAGGACGGAGAACTCGACCACTTTGCATGTTAACGATAAGAATACAACTATAAATAAAGTTGCTAAACTTCTGTCTAATTACACGAACAAGCCGTGGAAGAATCCGGAGTTGGAGGTAATCCGACCTCAACGGCCTCCAGCGCCACCCACAGACTACTTCCAAATGAACCTGGGTGGCTATGACCAATACTCGCTTCATCTAGCATCGCAAGCAGAAAACTTCCCACAAGGAGGGTACGTTGGCTGCGTAAGAGGATTTAAAATCGCCGACCACGTAGTAGATCTGTCTAAAAAGGCACAGCAAAATATTGATCAAGATTTAACAGGTGTACTACCAGAATGTAATATGAAATGTGACTCCGAGCCATGTAAAAATGGCGGTATTTGTACCGAAGACTTCACAAACCAAGAGAGCAGCTGTGATTGTGAATTAACAAGTTATTTTGGAGAATACTGTATGGAAGAGAAAGGAGCAGATTTTAATGGCGAGAGCATTTTACAAAGGAAATTTGTTAAAATAAAGTTGGCATTCTCCAGCAACGACCTTCGCCAAAAGAACACAGTTTTATTGCTTGTGCAAACAGAAAACAAACGCAGCTATTATCTTCTGGTAGCAATAACACAAGACGGTTACTTAAAATTCGAAGAAGATCGCGAAGATTCTGCGTATGGAGTAGAATTTAAGAACAGAAACTTTTTAAACGGCGCCCGGCATACAGTATATTATACAAGGTCAGATGACGAAGCGAAACTCTTAATAGACAGAATAGAAGTGCCATTAGAGAAGTTACCTCCACAAGATCTGTGGAAGGTGTTTGACGTTGGATCTAACGAAGTACAAATAGGAGGACTCAATACTACCGATCCACGGCTTAAAATATACAAGGGTTACAATGGATGCCTCTCTAATATTTTCGTGGAAATAAACGAGCACGTTATGAAACCTCTAGAGGAGTATATGCTTTTCACGCGTTCTGACTCAGAAAAGGTAAACGCAGTCAACGCTCAGGGTGTGAGGAGCGCGCAGTGTTCCGCGGACTTTGATGAAGCGTGGCCTGAGCACGATCAGCTTGGCGCTACACATAACGGCAGCTTCCTTATCAGTGTAGATAAGACTTGGGTAGAGGATCCACCATCCCGCCTGCCCTACGATTCTCTGCACCAGCAACCAGACACTGAGGAAGAGAATACAGACAAATTCTTTATAGCACTCATAGTAATATTCTTATTGGGGCTCTGTTACACAGCGTCGCACCTAAAAGAAATAGAAAATGGCGACAAAAAGGCAAATGGAGTCGTGATAGACTTAGTTCCTACAATAATCGTGGAAGTGAATGAAGAAAAACCTCCGAGCAGGAGAGGTTCACTTCGTTTTCGAGATATGGTAGATAAAGATATAGCTTGGCAACCTCTTGAAGAAAAGGATGAAATTTTAGAAAACGAAGAAGAAGAAGAAGAAGACGAATCAGAAAAGCAGGAACAAAATACTTCAGAAGAAAGCGAAGAAAATAATACAGATAACGAAGATGATATCTCGGATCATTTTGAAAATGAATCAATTGACGCTGTCAACGTTGTAAGGAAATTGTCAACGCTGTCGAATAAAATAAGCGAAGAAAATTCTATAGAGTTAACTTCGAGTGCATAA

Protein sequence:

>DPOGS207309-PA
MGLRVRRGLYVTIILISYGIADVDLSLPVDLAVEEKAPQQTTLPATSTSPLPTTINVSTSTYATIIPPIPPSDRDRFLTFAESGHHQTFMFAKDNTYIQLDGDIIQRFQLRLCREISFKFRTRLPHGLLVYHNVKNPVFKMQPYALYVIVEKGELKVVHVFGKHLTSVTVGRALNRDQWHSVVVTIDVHGARLIAKVDNLKEEVYLKGLSFDTNYGITDNLTSVILIGGLSSEEKLHGVKYIIESFVGCISDMVLSSGKAASDLLPIVPLIATKHENVKEGCINKCKTMENLCFEGSRCINEYNGYRCDCFGTLYEEQLCDVYTATILTLRGSSYVSYRVYDWKDRVHSTNTRVSLHFKTRFDDSALFYASGQIDDKHHYIALSIHQEKVGIQIDLGDGPVEDYLGVRVNNNMWHNITVILQEKTVHVYLDNISAIYEVPGDARFVCIDPEIYICGGPDLHKMKGLKSFNNFAGNLKYVYYNDVSILYELKQNNPKVHYIGVLDPEFEDIDIELIPITYPFATSHIWWPLNQSNSINLVFDFKTSKNMAVLAYSQITSGQGYFEVRMVKEEIRFELVPDVGKNVTVLKSVKFNVSNDWHNVELDYRKGRIKLTVDYHNKHAQMFGLDFQLQDKIVIGSGLKSANLGLIGCMRNIKINGLQIEPRYVINTERVVGEVAIDDCRYVDPCTRPNTCEHDGICSIREDRVICNCDNTGYIGENCHFATFRKTCEELALLGYTRNDVYLIDIDGNGKFPPAHVKCEFQIEADSSTTVVEHNLPSQVDVRSALGQDFSFHIKYREFTAEMLQELISHSLFCRQYIKYDCNMAPLELHSATWFISSSNDTVDYIGNVKKGYCPCGVNATCVNPTKSCNCDANENKWHSDEGTLVDPKSLGITEMFFLQQKDLTEEAQGRITLGPLECVETNTQRYVVTFTTSQSYIEVPGWRKGDIAFSFRTTGTSAILLFQPPIRPNYPSFMVALTSEHELTFNFTLNTGTTRKLVINSKRKLNGGEWHKIWIDYNFYHVRFMLNTEYQMLNLLLEEEFGPFEGSMFIGGATAEHLKKSAVNQGLIGCFRGLVVNGEILDIYSYMSVHLSEIIKDCKPSCVPNPCQNRATCKELWSTYECICKNPWAHLGEHCEENINEKALTFQTKESYLKKNYLVDNTTDAEKARLKKMMIENVLMNLRTYDDNALVLYANDNLNNFIHLFIHNGTEIIYLFNNEDEIVKMNVTYEKINKGESVQIAIIRTENSTTLHVNDKNTTINKVAKLLSNYTNKPWKNPELEVIRPQRPPAPPTDYFQMNLGGYDQYSLHLASQAENFPQGGYVGCVRGFKIADHVVDLSKKAQQNIDQDLTGVLPECNMKCDSEPCKNGGICTEDFTNQESSCDCELTSYFGEYCMEEKGADFNGESILQRKFVKIKLAFSSNDLRQKNTVLLLVQTENKRSYYLLVAITQDGYLKFEEDREDSAYGVEFKNRNFLNGARHTVYYTRSDDEAKLLIDRIEVPLEKLPPQDLWKVFDVGSNEVQIGGLNTTDPRLKIYKGYNGCLSNIFVEINEHVMKPLEEYMLFTRSDSEKVNAVNAQGVRSAQCSADFDEAWPEHDQLGATHNGSFLISVDKTWVEDPPSRLPYDSLHQQPDTEEENTDKFFIALIVIFLLGLCYTASHLKEIENGDKKANGVVIDLVPTIIVEVNEEKPPSRRGSLRFRDMVDKDIAWQPLEEKDEILENEEEEEEDESEKQEQNTSEESEENNTDNEDDISDHFENESIDAVNVVRKLSTLSNKISEENSIELTSSA-