Monarch geneset OGS2.0

DPOGS200752
TranscriptDPOGS200752-TA1896 bp
ProteinDPOGS200752-PA631 aa
Genomic positionDPSCF300030 + 584385-600069
RNAseq coverage14x (Rank: top 82%)
Annotation
HeliconiusHMEL0130210.076.84% 
BombyxBGIBMGA004752-TA1e-9238.79% 
DrosophilaCG34114-PB9e-11942.08% 
EBI UniRef50UniRef50_Q0KI851e-11642.08%CG34114, isoform B n=16 Tax=Drosophila RepID=Q0KI85_DROME
NCBI RefSeqXP_001980615.11e-11742.34%GG17847 [Drosophila erecta]
NCBI nr blastpgi|1949021562e-11642.34%GG17847 [Drosophila erecta]
NCBI nr blastxgi|3479666762e-11441.37%AGAP001824-PA [Anopheles gambiae str. PEST]
Group
KEGG pathwaydre:5699312e-15 
 K06781 (IGSF4, NECL2, TSLC1)maps-> Cell adhesion molecules (CAMs)
InterPro domain[68-170] IPR0137834.2e-19Immunoglobulin-like fold
[177-252] IPR0130982.2e-10Immunoglobulin I-set
[185-251] IPR0035987.1e-09Immunoglobulin subtype 2
[78-154] IPR0131621.3e-08CD80-like, immunoglobulin C2-set
[364-451] IPR0089574.8e-08Fibronectin type III domain
Orthology groupMCL15296 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200752-TA
ATGCCACGCATACGATGGTGGAAAGAGGACAAAGTTATCGCTAAACTGGATCCAGTCGAGGATGAGACACGGCTCAGTCTCCTGGAACTAAGGATTCCCTCTCTAAAGAGGGATCACTTCGAAGCGGTTTACAGTTGTACAGCGGATAATAATGCGATAGTGCCACCTTTACGAGTCAACGTGCAAATACAATTGTACCTGAGGCCATTATCAGTCGAAATCTTGGAAAGAGAGCAACCTTTGTCTGTGGGCCAAGAAACGGATATCGCTTGTAAAGCAGTTGGTGCTAGACCACCTGCAACCATCACTTGGTGGCTTGGTGATAAAAAGATGGTTGTGAATACTCCACAAACGAACTCGGAAGACCGTAATGAGACGATATCGTTCCTACGGTGGACTCCGCGAATGGAAAACGACGGCGGAGTACTCACCTGCAGGGCATCCCATCCAAAACTAGAACACGCCACTTTGGAAACTACTATGACACTTGATTTGCATTATGTTCCCATAGTTGAACTCCAATTAGGTTCAAAATTGAACCCTAATGATATTGAAGAGGGCGATGATGTTTACTTCGAATGCATAGTGCGGGCAAATCCACCGTCTTACAAAGTTGTTTGGGAGCACAATGGTCAGGTAATGACACACAACCAACGTGCTGGAGTAATAGCTGGGTCGGCGCACCTCGCTCTACAGGGAGTGTCGCGGGATCAGGCCGGCCAGTATGTCTGCGTTGCTAGCAACGTGGAAGGCGATGGACGATCACTTCCTGTGTCTCTGCAAGTTATTTATAAACCTATATGCAAGAACACCGTGACAGCAATAGTAGGAGCAGCTATCAATGAAGCAGCTAGAGTCGCTTGTGAAGTCGACGCTTTCCCACTACCGAAGAACTTTCAATGGACACTCAATAACACGTTGGGAACTACTGAGTTAGATGCGGGAAAGTTCACGATAGAAAAATCGGGTCGATCAATTCTCACGTACATACCAACTTCTGACATGGACTATGGGTCGTTGGCCTGTCGAGCTACAAACTTGGCTGGACAACAAATAGAGCCTTGCAGATACACCCTGCTACCGGCCGTCAAGCCTGACCCGCCTGGCAACTGTTCGACCTTGAACTTGACTGATGACTCAGCCGAAATAAAATGTGTTGCTGGCTATGATGGAGGTCTCCAAACTACGTATTTTGTGGAGGCATGGGAAGCGGATGAATTAATAGCAAATGTGTCAACAATACTTCCCGTTTGGAAGTTACAAGGGCTTGGATCTGGGAAGGCATTACAACTCGTGTTTTATGCTAGCAATGCGAGGGGGAGATCTGAAATAACAACGCTTAGAATACACACGTTGTCAAGATTAGCTTTACATACAGAAGCGAAGAATAATTATATGATAATTGACACAAACTGGGCCCTGGGTATTGTAGCGGGTGCTTTGGGTACGCTTGCTGCTGTTGCTGTAATTGCCGTCTTAGCTCGAAGGCGACAGAGGCCTATGGAATATGATGTTCCATTACAAACTATGAAAGCGTCTAAGGGTGCAATGCATAGTTCCAACAATTCCCCTATACAAGACGACAAGAACCCAGACGTAGTACCTCTTGGAAAAGACTTCAACTGTACAAGGGATTCTGAGCTCCCTCCGGAACCGCCGCCTTACGGGATAGCCCTATCGAATATTCAACCGGTGGGTGCGTCCAAAAGCGCGCACAGTCTCCACGATCGAGCGAACACGCCGCATACTCTTAATACGCCTGGATCCGATGGACAGAGCATCAGTACACTGACTGAGGATCGGAGGAGTTTGACGAGTGGAACACTATCGCGGAGGAGAGAGGTCGTCACGACTAGGACCACACTCTTAACCAACGCCAGAGAGAGTTGTGTGTAA

Protein sequence:

>DPOGS200752-PA
MPRIRWWKEDKVIAKLDPVEDETRLSLLELRIPSLKRDHFEAVYSCTADNNAIVPPLRVNVQIQLYLRPLSVEILEREQPLSVGQETDIACKAVGARPPATITWWLGDKKMVVNTPQTNSEDRNETISFLRWTPRMENDGGVLTCRASHPKLEHATLETTMTLDLHYVPIVELQLGSKLNPNDIEEGDDVYFECIVRANPPSYKVVWEHNGQVMTHNQRAGVIAGSAHLALQGVSRDQAGQYVCVASNVEGDGRSLPVSLQVIYKPICKNTVTAIVGAAINEAARVACEVDAFPLPKNFQWTLNNTLGTTELDAGKFTIEKSGRSILTYIPTSDMDYGSLACRATNLAGQQIEPCRYTLLPAVKPDPPGNCSTLNLTDDSAEIKCVAGYDGGLQTTYFVEAWEADELIANVSTILPVWKLQGLGSGKALQLVFYASNARGRSEITTLRIHTLSRLALHTEAKNNYMIIDTNWALGIVAGALGTLAAVAVIAVLARRRQRPMEYDVPLQTMKASKGAMHSSNNSPIQDDKNPDVVPLGKDFNCTRDSELPPEPPPYGIALSNIQPVGASKSAHSLHDRANTPHTLNTPGSDGQSISTLTEDRRSLTSGTLSRRREVVTTRTTLLTNARESCV-