Monarch geneset OGS2.0

DPOGS208940
TranscriptDPOGS208940-TA2016 bp
ProteinDPOGS208940-PA671 aa
Genomic positionDPSCF300009 + 179109-181492
RNAseq coverage672x (Rank: top 19%)
Annotation
HeliconiusHMEL0080080.090.47% 
BombyxBGIBMGA002413-TA0.079.94% 
DrosophilaCont-PA0.060.32% 
EBI UniRef50UniRef50_B2DBK60.084.51%Contactin n=2 Tax=Endopterygota RepID=B2DBK6_9NEOP
NCBI RefSeqXP_395773.30.066.72%PREDICTED: similar to Contactin CG1084-PA [Apis mellifera]
NCBI nr blastpgi|1839792920.084.51%contactin [Papilio xuthus]
NCBI nr blastxgi|1839792920.084.51%contactin [Papilio xuthus]
Group
Gene OntologyGO:00055155.5e-12protein binding
KEGG pathway 
InterPro domain[25-119] IPR0137832.2e-22Immunoglobulin-like fold
[217-321] IPR0089573.5e-22Fibronectin type III domain
[40-104] IPR0035981.4e-14Immunoglobulin subtype 2
[34-115] IPR0035991.7e-13Immunoglobulin subtype
[35-114] IPR0130982.2e-13Immunoglobulin I-set
[218-311] IPR0039615.5e-12Fibronectin, type III
Orthology groupMCL10256 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208940-TA
ATGTACCAATGTGAGGCCAAGAACCAGCTAAGAACAAAGTATTCCACTGGCCAGTTACGTGTGCTGTCATTGAAGCCATCATTTAAGAAGCATCCATTAGAGTCCGAAACATATGGATCAGAAGGTGGGAATGTGACTCTCAGGTGTAACCCTGAAGCTGCGCCAAAGCCTACATTCACGTGGAAGAAAGATAACATTGTCATAGGTGCCGGAGGGAAGAGGTTTATTACGGAGAATGGAAATCTTATTATAAGACAACTATCTAGAGACGATGAAGGTGTTTACACTTGTGTTGCCAAGAATCAGTACGGCACGGACGAGAGTCGGGGACGTCTTATAGTTCTTAGAGCCCCTCGTTTCATTGAACGATTACCTCCAAGAATAACAACACAGGTAGGACAAGTCATTTTCCTTCATTGTAATGCTGAAATAGATTCTATGCTCGACACAGCTTACTTGTGGAATCACAATGGTATCAGGATGAAGGAAGCTGCTGATCTGTACGCAGATAAACGAATTAAAATCGACGGTGGTGAACTTACGATATTCGACGTGAGTTTGTCGGATGTTGGAGAATACGAGTGCATTGTAAAATCAGCCATAGGTCGTATTTCGACTCGCACACAATTACGTATAGAAGGGCCGCCTGGTCCACCTGGTGGTGTACAGGTTTCAAATATTCAACAGTCTTCTGTTACCCTGGAATGGACCGACAGCAATCCTAATGGACGTCCTATAACTAATTATGTTGTCACGGGACGAACACAATGGAATTCAACTTGGTTTGTGTTGAGCGAGAGTGTCACAAATGTATTTGAAATAGACAGATACAATGGACGCAAACGAGCCACCGTAACCACAACCTTGATGCCTTGGTCTGTGTATGAGTTCAGGGTCCAAGCTGTCAATAGTTTAGGTATAGGGGAACCGTCTTCTCCTAGTCCTCAATTCTCCACACAAGCCGATAAGCCCTATCACAGTCCTTTCAACATTGGAGGTGGTGGTGGAAAAATTGGAGACCTTACCATTAAGTGGACACCGCTACCAAGATCCTTGCAAAACGGTCCAGGCATTTACTATAAGATATTTTGGCGTCGGAATAACACAGAAGTCGAATTTCAGTCACTTTCACTTAAAGAATATGGGAACATAGGAATGCACGTTGTACACGTACCTTTGGATTACTTCTTCACTCCTTACAATATAAAAGTGCAGGCGTTCAATGACATTGGCGCGGGTCCCGAGAGTGAGGTAGTCACTATATATTCTGCTGAAGATATGCCTCAAGTCGCCCCTCAGCAAGTATATTCAAGATCATTCAACTCTACCTCTCTTAATGTAACGTGGAACCCTATCGACCAGTCACGTGATAGACTACGAGGGAAATTAATCGGTCACCGTCTTAAGTACTGGAAGCAAGCTAACAAGGAAGAGGAATGCATCTACTATTTATCAAGAACAACACGCAATTGGGCTTTGATTGTAGGGCTCCAGCCAGACACATACTATTATGTCAAAGTTATGGCGTTTAATTCAGCTGGTGAAGGTCCGGAAAGCGAGCGATACTTGGAGCGTACATACAGGAAGGCTCCTCAGAAGCCCCCTGCTTCTGTCAACGTGTGGGCTGTCAACCCCAGCACCGTGCGAGTTGTTTGGCGATATGTCCAGCCCACTGACGAAGAGGAACCTCTCATAGGCTATAAGGTGAGAGTGTGGGAGATAGATCAAGATATGAGCACGGCTAATGACACTTTAGTGCCGGTGGAACATAAATTGGAAGCGTATGTCAATAGTTTGACTCCAGGGAAGTCTTATAACCTGAGAGTCTTGGCTTACTCCAATGGTGGGGACGGCAGGATGTCCTCTCCCCCAATAACATTCCAAATGGGCGACTACCACAGTGATGCTTACGGATACTATGTTAGAAGTGCAGCTCATAAACAAAATGTCTCTCACATATACTCTGTAGAAATGTTTCTATTTTATATATTTTTCTTAACTTGTTACAGTCTGTAG

Protein sequence:

>DPOGS208940-PA
MYQCEAKNQLRTKYSTGQLRVLSLKPSFKKHPLESETYGSEGGNVTLRCNPEAAPKPTFTWKKDNIVIGAGGKRFITENGNLIIRQLSRDDEGVYTCVAKNQYGTDESRGRLIVLRAPRFIERLPPRITTQVGQVIFLHCNAEIDSMLDTAYLWNHNGIRMKEAADLYADKRIKIDGGELTIFDVSLSDVGEYECIVKSAIGRISTRTQLRIEGPPGPPGGVQVSNIQQSSVTLEWTDSNPNGRPITNYVVTGRTQWNSTWFVLSESVTNVFEIDRYNGRKRATVTTTLMPWSVYEFRVQAVNSLGIGEPSSPSPQFSTQADKPYHSPFNIGGGGGKIGDLTIKWTPLPRSLQNGPGIYYKIFWRRNNTEVEFQSLSLKEYGNIGMHVVHVPLDYFFTPYNIKVQAFNDIGAGPESEVVTIYSAEDMPQVAPQQVYSRSFNSTSLNVTWNPIDQSRDRLRGKLIGHRLKYWKQANKEEECIYYLSRTTRNWALIVGLQPDTYYYVKVMAFNSAGEGPESERYLERTYRKAPQKPPASVNVWAVNPSTVRVVWRYVQPTDEEEPLIGYKVRVWEIDQDMSTANDTLVPVEHKLEAYVNSLTPGKSYNLRVLAYSNGGDGRMSSPPITFQMGDYHSDAYGYYVRSAAHKQNVSHIYSVEMFLFYIFFLTCYSL-