Monarch geneset OGS2.0

DPOGS206372
TranscriptDPOGS206372-TA3837 bp
ProteinDPOGS206372-PA1278 aa
Genomic positionDPSCF300192 - 221056-243236
RNAseq coverage1921x (Rank: top 6%)
Annotation
HeliconiusHMEL0090190.079.85% 
BombyxBGIBMGA014065-TA0.064.33% 
DrosophilaNrg-PG0.058.61% 
EBI UniRef50UniRef50_P202410.057.82%Neuroglian n=48 Tax=Pancrustacea RepID=NRG_DROME
NCBI RefSeqXP_002430407.10.059.92%Neuroglian precursor, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2666345340.083.08%neuroglian [Mythimna separata]
NCBI nr blastxgi|2666345340.083.08%neuroglian [Mythimna separata]
Group
Gene OntologyGO:00055157e-12protein binding
KEGG pathwaygga:3962023e-169 
 K06756 (NRCAM)maps-> Cell adhesion molecules (CAMs)
InterPro domain[437-531] IPR0137838e-23Immunoglobulin-like fold
[906-1035] IPR0089573.6e-21Fibronectin type III domain
[438-523] IPR0130981.1e-16Immunoglobulin I-set
[357-423] IPR0035981.4e-13Immunoglobulin subtype 2
[926-1009] IPR0039617e-12Fibronectin, type III
[533-618] IPR0035994e-11Immunoglobulin subtype
Orthology groupMCL10464 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206372-TA
ATGAAGCACAGAAGCAACATGGAAACCAGTACAGTGATAATACTCTTCGCGATTTTGTCGCAAACGACAGCGCTGCTTTATAACACAACAGTGACATCACCACCGAGGATTGTGAAGCAGCCGACGGTTGAAGAATTGCTGTTCCAAGTGGCACAGCCGGGGGAGGTGGACAAGCCTTTCATCATCGAATGTGAAGCCGAGGGAGAACCCGCACCCAAGTACAGATGGATCAAGAACGGCAAGTCGTTCGAGTATACGAGTTACGATAACAGAATATCCCAGCAGCCTGGACGGGGCACGCTGGTGGTCAGTCAGCCGAGGAACGAGGACTTGGGACAGTACCAGTGCTTCGCCTACAACGAGTGGGGCACGGCTACATCCAACTCGGTGTTCGTGAGGAAGGCGGAGTTGAACTCCTTCAAGGAGACGGACGGACCTCAGAAAACGATCACTGCTCAAGAAGGCCTACCGTTCAAACTGACCTGCGAACCGCCGGACGGCCATCCAAGCCCTAACGTGTACTGGATGCTGCAAGGGGAACAGGGCCAGCTGAAGACTATCAACAATTCCCGAATGACCTTAGACCCCGAGGGCAACCTCTGGTTCTCGAACGTGACCAGGTTCGACGCGAGTGTGGACTACGCCTACACGTGCGCCGCTAAATCCGTGTTCAGGAACGAGTATAAGCTCGGCAATAAAGTGTACTTGCAAGTTCAGCAGACTGGAATCTCACCGACGTCGAACAAACACCAGCCCGTGTTGCAATACGCGACCAGGAGGGTCGAGAAGGCTTGGAGAGGGAAGAAGGTGGAGCTGTACTGCATATACGGCGGGACGCCGCTACCTCAAGTGGTGTGGAAGAAAGAGGGTCGAACGATAATATCCTCGCAGCGGATAACTCAAGACAACTACGGCAAGACGCTCGTCATCAAACGTGCCGGCTACGAGGACCAGGGGACGTACACCTGCGAAGTCAGCAACGGAGTCGGAACAGCCGAGACCTACTCGATACAACTCAACGTTGAAGCGGCTCCGTTCTTCATCGAGGAGCCCCAGTTCCAGAACCTGGCGGAGGGCGAGACGGCCGTCATCCAGTGTAGGGCGGGCGGCACACCAGAGCCGACCATCTCGTGGGTCCACAACGGGAAGCCCATAGAGCAGGCCGAGAACAACCCCCGCAGGCAGGTCACCGCCAACAGCATCACCATCACCGACCTCGTGAAGAAAGACACCGGCAACTACGGGTGTAACGCGACCAACTCGAACGGTTACGTCTACAAGGACGTGTTCATCAACGTGCAGTCCATCCCTCCTGAGATCAAGGAGGGTCCGGACAACCTGACGGTGGTGGCGGGGTCCACGGCCGCTCTCAGGTGTCGCGTGTTCGGAGCACCCCGACCCATCGTCAGGTGGATGAGAGACGACATCGACGTAACTGGCGGCAAATACAACATAACGAAGGAAGGTGACCTGGTGATAGCCGATGTTTCCTTCACCGACGTCGGCACGTACCAATGTTACGCCAAGAACAAGTTCGGAGAGAAGTCCGCCTTCGGATCGCTCACTGTTAAGAAACGCACAGTGATAACTGACAAACCCGAACCGTACGAGGTCGCGGCGGGGTCGTCGGCGACCTTCAGGTGTAACGCGAACGCAGACGACTCCCTCAAGCTAGACATCATATGGCTGAACGACGGACAACCGATAGACTTCGACAACCAGCCGCGGTTCCGGATCACCAACGACTACTCGCTGCTCATCTCCGACACCAGCGAGCTGGACTCCGGAGAGTACACCTGCATAGCGAAGACCGCCATCGACGAGGCGAGGGCCCAGGCCACGCTCACCGTACAAGATAAACCGAACCCCCCAGCTCTAGAGGGCATCGAATGTCACGAGCGCACCGCGACTCTCCGCTGGCACTCGATGGGTGACAACCGCGCTCCCATCCTCCGGTACTCCATCCAGTATAACACGAGCTTCACGCCTCACGCCTGGGACCTCGCCTCCGACAACGTCCCCGCCATCGACACCTCGTGGACGGTGCAGCTCAGCCCCTGGGCCAATTATACGTTCAGGGTGACGGCGTCCAACAAGATCGGCGCCTCGGCTCCGTCGTCTCACTCGGACGTGTGCACCACGCAACCAGACGTGCCTTACAAGAACCCTGACAACGTGGAGGGGAAGGGAACGGATCCTACCAACATGGTCATCACTTGGTCGAAAATGCCTCAAATCGAGCACAACGGTCCTGGATTTTACTATCTAGTGTCGTGGAAAAGGAACATTTCAGGGGCGGCGTGGAGCGAGGAACAAGTGCGGGACTGGGCCACGACGGAGTACGTCATACCAAACACACCCACCTTCAAACCTTACAAGATCAAGGTTATTGCTGTCAACTCGAAGGGAACATCAAACGTCGCTCCGGTCGAGGTCATAGGTTGGTCGGGCGAGGACACTCCTCTACAGGCTCCCACTAACTTCACTTTAGTACAAGTCACCACTGGAACCACGGCTCTGCTGTCCTGGAACCCGGTACCGGCCGAGTCGGTGAGGGGACACTTCAAGGGTTACAAGATACAGACCTGGATCGACGGCGAGGAGGACCGGCCCAAGGAAATCCTCATCAAAGCAGATGCCTCCAGCACACTGGTCACCAAATTCAAGCCCTTCAAGAAGAATAACGCTCGGATCCTCGTTTACAACGGAAGGTTCAACGGACCGCCCAGCGAGACTCTCAGCTTCAACACACCAGAAGGCAAGCCCGGCACGGTGCGATCCTTCGAAGCTTATCCAATAGGCTCCTCGGCCATGCTTTTAAAATGGGACAAACCTTTAGACGAGAACGGTATTCTGACCGGGTACAAGATATACTACCAGAAGGTGACCGGCACGGCGCTCGGGCCGCTGCAGGAGAGGAAGAAGGAGATCGATCCCAAGTTCGACCGCGCCAAGCTGGCGGGGCTCGAGCCCAACACCAAGTACAGAATAGAGATACGGGCCAAGACGAAGGCCGGGGAAGGGGACAAATATTACGTGGAGCAGACCACCAGGGCCATGGTCACGGCCGTGCCGGACGTGCCGGTGTTCGAGACCAGCACTCTACCCGCCAAGGAAGGAACGGCGCACATACTGGTCAGGTGGATACCTTCCCTGGAAGGACATGCGGGGACGCACTTCATCGCCTGGTATAGACTGAGGGGAAGACCCGACTGGCTGCAGAGCAACGAGGTCACAGAGGACGACTACGTCATACTGACGGGACTCGAACCCGGACTCGAGTACGAAGTCAAGGTCACCGCTCATGACGGAGACTACTACAGCACCAGCGACGTTAAGGTCGTCGACACAACCATCGACGGCCCCCACGTCCGCCCACAAGAGGCGACGGCTACCGCGGGCTGGTTCATCGGCGTGATGGTGGCGCTCGCCTTCCTGCTGCTGGTGCTGGTGCTGGTGTGCGTCGCGCGCCGCAACAGGGGCGGCAAGTACGACGTGCACGACAGGGAGCTCGCGCACGGGCGGGCCGACTGCCCCGACGCCGCCTTCCACGAGTACACGCATCCATTGGACAACAAGTCGCGGCACTCCATGAGCAGCGGCACCAAGCCCGGCCCCGAGAGCGACACGGACTCCATGGCGGAGTACGGCGAGGGCGAGACAGCGGGCATGAACGAGGACGGGTCCTTCATCGGTCAGTACGGCCGCAAGCGCCGCCCGCCGCCTGGTCGCTTCACCGAGGACGGTTCTTTCATCGGTCAGTACGTGCCTGGTGCGCGCGTGTTGGCTCCTCCCCCTCCCGCGCCCGCGCCCGCCGCCCCGCCCACATACGTGTAG

Protein sequence:

>DPOGS206372-PA
MKHRSNMETSTVIILFAILSQTTALLYNTTVTSPPRIVKQPTVEELLFQVAQPGEVDKPFIIECEAEGEPAPKYRWIKNGKSFEYTSYDNRISQQPGRGTLVVSQPRNEDLGQYQCFAYNEWGTATSNSVFVRKAELNSFKETDGPQKTITAQEGLPFKLTCEPPDGHPSPNVYWMLQGEQGQLKTINNSRMTLDPEGNLWFSNVTRFDASVDYAYTCAAKSVFRNEYKLGNKVYLQVQQTGISPTSNKHQPVLQYATRRVEKAWRGKKVELYCIYGGTPLPQVVWKKEGRTIISSQRITQDNYGKTLVIKRAGYEDQGTYTCEVSNGVGTAETYSIQLNVEAAPFFIEEPQFQNLAEGETAVIQCRAGGTPEPTISWVHNGKPIEQAENNPRRQVTANSITITDLVKKDTGNYGCNATNSNGYVYKDVFINVQSIPPEIKEGPDNLTVVAGSTAALRCRVFGAPRPIVRWMRDDIDVTGGKYNITKEGDLVIADVSFTDVGTYQCYAKNKFGEKSAFGSLTVKKRTVITDKPEPYEVAAGSSATFRCNANADDSLKLDIIWLNDGQPIDFDNQPRFRITNDYSLLISDTSELDSGEYTCIAKTAIDEARAQATLTVQDKPNPPALEGIECHERTATLRWHSMGDNRAPILRYSIQYNTSFTPHAWDLASDNVPAIDTSWTVQLSPWANYTFRVTASNKIGASAPSSHSDVCTTQPDVPYKNPDNVEGKGTDPTNMVITWSKMPQIEHNGPGFYYLVSWKRNISGAAWSEEQVRDWATTEYVIPNTPTFKPYKIKVIAVNSKGTSNVAPVEVIGWSGEDTPLQAPTNFTLVQVTTGTTALLSWNPVPAESVRGHFKGYKIQTWIDGEEDRPKEILIKADASSTLVTKFKPFKKNNARILVYNGRFNGPPSETLSFNTPEGKPGTVRSFEAYPIGSSAMLLKWDKPLDENGILTGYKIYYQKVTGTALGPLQERKKEIDPKFDRAKLAGLEPNTKYRIEIRAKTKAGEGDKYYVEQTTRAMVTAVPDVPVFETSTLPAKEGTAHILVRWIPSLEGHAGTHFIAWYRLRGRPDWLQSNEVTEDDYVILTGLEPGLEYEVKVTAHDGDYYSTSDVKVVDTTIDGPHVRPQEATATAGWFIGVMVALAFLLLVLVLVCVARRNRGGKYDVHDRELAHGRADCPDAAFHEYTHPLDNKSRHSMSSGTKPGPESDTDSMAEYGEGETAGMNEDGSFIGQYGRKRRPPPGRFTEDGSFIGQYVPGARVLAPPPPAPAPAAPPTYV-