Monarch geneset OGS2.0

DPOGS214753
TranscriptDPOGS214753-TA4200 bp
ProteinDPOGS214753-PA1399 aa
Genomic positionDPSCF300022 + 783867-806958
RNAseq coverage138x (Rank: top 55%)
Annotation
HeliconiusHMEL0169460.079.09% 
BombyxBGIBMGA004748-TA0.078.94% 
DrosophilaNrx-1-PB0.059.85% 
EBI UniRef50UniRef50_Q7PSM90.060.83%AGAP004066-PA n=4 Tax=Endopterygota RepID=Q7PSM9_ANOGA
NCBI RefSeqXP_309618.40.061.22%AGAP004066-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479711680.060.83%AGAP004066-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1571125620.053.85%neurexin [Aedes aegypti]
Group
KEGG pathwaynvi:1001208510.0 
 K07377 (NRXN)maps-> Cell adhesion molecules (CAMs)
InterPro domain[635-812] IPR0089853.1e-45Concanavalin A-like lectin/glucanase
[230-420] IPR0133203e-40Concanavalin A-like lectin/glucanase, subgroup
[241-383] IPR0017915.4e-39Laminin G domain
[665-790] IPR0126803.6e-29Laminin G, subdomain 2
Orthology groupMCL10461 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214753-TA
ATGAGAAAGTTTGAGAAAAACAAAGCTCCTTCGGAGGCTACGTTCCGAGGAGCCGAATTCCTCTCGTACGACCTCACTCAAACCGGCGGCGAACCCATAGTTAGCACTCAGGATACAATCTCACTCTACTTTAAGACGAGACAACCAAATGGACTCCTGTTTTACACTGGTCATGAAGCTGATTACCTGAACCTTGCTGTACGGGACGGTGGCGTCTCCCTCACCATGGGTTTGGGGAACGGGAAGCAGGAGATGCACATAAAGCCGAGCAAAACACGCTTTGACGATCACCAGTGGCACAAACTGACAGTCAGGAGAAAGATACAGGAGATTACACCGTTCACCAGCTTCTGTCGGGTGTCAGCTGTAGTAGATGACGTGTACTCAGAGCATTCCCACGTGGCTGGATCCTTCACAATGCTGGCCAGCTCGAGGGCCCACGTTGGAGGCTCCTTAAATGCACGAGCACTTCCAGGAGCCAGGGTTCATACAAACTTCATAGGGTGTCTAAAAAAGGTTGAGTTCTCAGCGGACACGCTTCGTCTGAACCTCATAGATCTAGCTCGCACCGGTAGTAAGTTAATCACTGTGACTGGACGCCTGGAGTACGTTTGCACAGCCACTGATGCAGCCGATCCTGTCACCTTCGCTACCAGGGATGCGCATCTGTTATTTCTACGATATATTTACAAAAATATCGAACCGAAATGGGAAGCTGTGAAGACCGGGACGATATCGTTCAAGTTCCGTACGAACGAACCGAACGGCTTGATTCTGTTCAACATGGGCGCGAAGCCACCAAGGGCGGACCTGTTCGCGGTTGAGATTCTTAATGGGTACGCATATGTGCATGTGGATCTTGGATCAGGTGGGGTCCGAGTGAGGGCATCTAGGAGGAGGATTGATGATTCACATTGGCATGAGTTCCTATTGAGGAGGACAGGCAGGGATGGGAAGGTTACCGTTGATGGGGCCAATGCGGAGTTCAAAACCCCTGGTGAATCCAATCAGTTAGAGCTCGACGGGCCGTTATTCGTGGGCGGACTGGGCTCGGAGTACTCCGCTTCCAGGACTCCAGCCGCTGTATGGACTGCGGCCTTGAGACAGGGTTTCATCGGATGTATCAGGGATTTAGTACTGAACGGAAAACCACAGGATCTGACAGCATTTGCCAGACAACAGGATTCTGCGTCTGTCCGTCCTGCGTGTCATGTGCTGATGAAGCAGTGTGCGAGCGCTCCCTGCCAACACGGAGCCCCCTGCTCTGAAGGATGGAACAGACCTCTGTGTGACTGTTCGGGGACCAACTATGGTGGACCTACTTGCGGGCGAGAGTCTCCCACGATATATTTGAATGGCAGCCAGCACTTAACGGCGTCCCTGGGGGCGGAGCATGTGACGCAGACGGAGGAGTTACTGCTGAGGTTCCGGACCACCAGAGCGGGCCTTCTGCTGAGGACTGCCTCCGAGCACTCCGCTGACAGGATTGAACTGGCCGTGGCCGCTGGCAGGGTGCGAGCCAGTGTCAGACTGGGAGACAGAGAAAAGAACCTCCTTGGCGGCAGCAGTGTTTGGGATGACCGTTGGCACACGGTCCGCTTCAGTCGGCGCGCCTCCAACCTGAAGCTGCAGGTGGACACCGAAACAATTTTAGGAAAAGCGTCGACCCTTGAGATCTCTACGTTACATTTGGGAGGACTGTTCCATCCTGAAGAGGAAATCCAAATGACGTCGACGTTACCGAACTTCGTTGGATATTTGCAGAAGTTTGTTTTCAACGGAATAAAGTATATAGACCTAGCCAAAAGCCTCGGGATAGGGAGTGGTGACGAACACAACGACATAGATGACACTTCGAATATTATATTTACTGGGAAGTTCGTCAAACCAGACTCTTTGAATGTCTATAAAGCCGTGACATTCAAATCGAAGCACACATATGCGGGGCTACCATTGCTCAAAGCGTACGGAAACACATATTTGGATTTCTATTTCCGTACGACAGAAATGGACGGTCTGTTGTTCTACAACGGGGGGAAGAAACAAGACTTCATAGCCATAGAACTGGTGAACGGTCATGTTCACTGCGTGTTCAACCTCGGTGATGGTGTGGTGACCATGAAGGATAAGCTGAAGACGTTTGTTAATGACAATCGTTGGCATACAGTCTCTATACGGCGACCGACTCCTAAAATACACACCATGCAAGTCGATGATGACGTAGAAATGCATACCACGAGTTCAAACTTAATGCTTGAGTTGGACAGCGTGTTATATGTTGGGGGGGTGCCAAAGGAGATGTACACTTCACTTCCGGTGGGAGTGCTGTCACGGCAAGGGTTCGAGGGTTGTATGGCGAGTCTGGACCTGCCCGGGGAATCTCCGTCCTTGATCGAAGATGCAGTAGTACCTAGTTCATCACTGGTGTCGGGGTGTGAAGGGCCCACGAAGTGCACTCATAATGCGTGCTCAAATAAAGGAGTCTGCGTTCAACAATGGAACACGTATGTGTGCGACTGCGACCTTACGTCTTTTACTGGACCAACTTGTTATGACGAGTCGATAGCGTTTGAGTTTGGCCCCGGGCGCGGCGTGTTGACCTACACGTTCCCGCCCGGTGCCCGCGCGGACACGGACAGTGACCGGGTCGCGGTCGGCTTCCTCACCACTAAGAGTGACGCTGTACTGGTCAGGATCGACTCTGCGGACACACAGGACTACATGCAGATGGAAATTATCCGAGGTAATTTGTTCACTGTCTACAACGTCGGTGACGGAGAGCATCCTCTGGGTGACGAGTCAGCGCGTGTAGATGACGGCGTCTACCACGTGGCCAGATTCACCAGGAATGGGAGCAGGGCGGCGCTTCAACTAGACGATTATGCTGTTAATATCCGACACCCTCAAGGAGGGCAGCAGTCCACCATTTTCAATAGCATGTCCCGTGTGACGGTGGGTGGTGGTAGTGGTGGTGCGAGATCGTTCGCGGGAGTTGTTGCTGGTCTGGTGGTCAACGGGGCAAGGGTGTTAGACCGGGCCGCCGCCGCTGACCCGGCCGCAGCACTCAGGGGGGACGTGAGACGAGCCGCTGGACCTCTGGACCGTGACCTTAACAGAATGCAGCAGACGCCTCCATCGGGATACGGTGGTCCGGGCGCTCCAGATGAGTTGGTGTATTCCGGCGCTGGTTCCGGATGCCGTGATGATGACGAGGACGCCTGTGTATTACCTGACGCAGGCTCTGGCGATGATCTTATCACGCCAGTCTATGTTCCCTCCACCAGAAGACCGCCCGCCAAGATGCATAAGGGTGACGTGAGTGGCAAACTGATGAAACCCTGCGATGATGAGGATTGCATCGAAGGCTCGGGCTCTGGCGGGGATGACGTCACCGAGCCCGAACATCCTTCTACCACGAAAATTATTGAGTATCCCAAACAAAAAACCAGCGGCAGCAGCACCCACGCGATGACGTCACCTACGGCCGCTGTGTCCACGGATCACGACAGGTCCGTGGGATCCACGGGTTCAGAGGGATCCACGGAGATGACGAGACGAACAGATCCCGAGACCGAACATCCCACGACCATCCACGATCACGACAACATGCACGCCGGGACACTCGACACGGCCACCACAGACAAGGGCACCACGCCCACCGATAACAGAATAGATGATCGCACACACACCTTCACACATACAACGGACCACGAGCCGACGTATGAACACACGGACTACGAGACGCACACAGACTCGCGACCAACACACGACGGTAACGAGAGGAACAGGATTTCGGACTACAACAACGAGCCAGAGATCACCACCAAATGGTACCACCCGAAGACCACCGATAACAGAGTCGTGCCGCCTGAGTCAGAGTTCTTCGCTACCATCGTTGGCATAGTGGCGTCCGTGTTGATAGCGATCATTCTGATAGTGATAATAATATTGAAGCTACTATTCCGTCTCGATCCTTCCTACAAAGTCACAGAAGATAAAAGTTACCAGCAGAGCGCCAGCGCTGCACTGTTAGGAAATCAGGCACACTCCGGCTACCAGGCGGCGAGTGGTATGAACGGTGGGGGTGGCAGCAATGGTAGCGGAGGTAACGGTGGGGGGGCGCGCTCCTTACAGCCGTTACCTCTCAACCGAGCGCCGCAACCTGTGAAGAGAGACGGCATCAAAGAGTGGTATGTGTAG

Protein sequence:

>DPOGS214753-PA
MRKFEKNKAPSEATFRGAEFLSYDLTQTGGEPIVSTQDTISLYFKTRQPNGLLFYTGHEADYLNLAVRDGGVSLTMGLGNGKQEMHIKPSKTRFDDHQWHKLTVRRKIQEITPFTSFCRVSAVVDDVYSEHSHVAGSFTMLASSRAHVGGSLNARALPGARVHTNFIGCLKKVEFSADTLRLNLIDLARTGSKLITVTGRLEYVCTATDAADPVTFATRDAHLLFLRYIYKNIEPKWEAVKTGTISFKFRTNEPNGLILFNMGAKPPRADLFAVEILNGYAYVHVDLGSGGVRVRASRRRIDDSHWHEFLLRRTGRDGKVTVDGANAEFKTPGESNQLELDGPLFVGGLGSEYSASRTPAAVWTAALRQGFIGCIRDLVLNGKPQDLTAFARQQDSASVRPACHVLMKQCASAPCQHGAPCSEGWNRPLCDCSGTNYGGPTCGRESPTIYLNGSQHLTASLGAEHVTQTEELLLRFRTTRAGLLLRTASEHSADRIELAVAAGRVRASVRLGDREKNLLGGSSVWDDRWHTVRFSRRASNLKLQVDTETILGKASTLEISTLHLGGLFHPEEEIQMTSTLPNFVGYLQKFVFNGIKYIDLAKSLGIGSGDEHNDIDDTSNIIFTGKFVKPDSLNVYKAVTFKSKHTYAGLPLLKAYGNTYLDFYFRTTEMDGLLFYNGGKKQDFIAIELVNGHVHCVFNLGDGVVTMKDKLKTFVNDNRWHTVSIRRPTPKIHTMQVDDDVEMHTTSSNLMLELDSVLYVGGVPKEMYTSLPVGVLSRQGFEGCMASLDLPGESPSLIEDAVVPSSSLVSGCEGPTKCTHNACSNKGVCVQQWNTYVCDCDLTSFTGPTCYDESIAFEFGPGRGVLTYTFPPGARADTDSDRVAVGFLTTKSDAVLVRIDSADTQDYMQMEIIRGNLFTVYNVGDGEHPLGDESARVDDGVYHVARFTRNGSRAALQLDDYAVNIRHPQGGQQSTIFNSMSRVTVGGGSGGARSFAGVVAGLVVNGARVLDRAAAADPAAALRGDVRRAAGPLDRDLNRMQQTPPSGYGGPGAPDELVYSGAGSGCRDDDEDACVLPDAGSGDDLITPVYVPSTRRPPAKMHKGDVSGKLMKPCDDEDCIEGSGSGGDDVTEPEHPSTTKIIEYPKQKTSGSSTHAMTSPTAAVSTDHDRSVGSTGSEGSTEMTRRTDPETEHPTTIHDHDNMHAGTLDTATTDKGTTPTDNRIDDRTHTFTHTTDHEPTYEHTDYETHTDSRPTHDGNERNRISDYNNEPEITTKWYHPKTTDNRVVPPESEFFATIVGIVASVLIAIILIVIIILKLLFRLDPSYKVTEDKSYQQSASAALLGNQAHSGYQAASGMNGGGGSNGSGGNGGGARSLQPLPLNRAPQPVKRDGIKEWYV-