Monarch geneset OGS2.0

DPOGS215849
TranscriptDPOGS215849-TA1386 bp
ProteinDPOGS215849-PA461 aa
Genomic positionDPSCF300073 + 742236-743875
RNAseq coverage39x (Rank: top 73%)
Annotation
HeliconiusHMEL0074620.082.86% 
BombyxBGIBMGA002949-TA0.074.42% 
DrosophilaCadN-PL3e-10646.21% 
EBI UniRef50UniRef50_B0X8X48e-10848.76%Predicted protein n=1 Tax=Culex quinquefasciatus RepID=B0X8X4_CULQU
NCBI RefSeqXP_001866096.11e-10848.76%predicted protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700611133e-10748.76%predicted protein [Culex quinquefasciatus]
NCBI nr blastxgi|1700611131e-10548.44%predicted protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00160201.6e-12membrane
GO:00071561.6e-12homophilic cell adhesion
GO:00055091.6e-12calcium ion binding
KEGG pathway 
InterPro domain[115-227] IPR0021261.6e-12Cadherin
[114-223] IPR0159191e-09Cadherin-like
Orthology groupMCL22717 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215849-TA
ATGGCGCACCAAGTGGTGGTGGGAGGAGCTTTGCTCTTCTTATTATGTTTTGCTCTGAATACAAGTTCACTCATCCTCTTGCCGCACGATGTCCGCCCTGGCCACGCTGTTCGACATTTTATTGGTAACCATTCGCGTTACACGCTACTTGATCCGGAATATGAACCTTTCTTTACATTACTCGACGATGGATTGCTTATGACAACAGCTGATCTCGGACCACTGTTGAATCAACCTCTAAATCTCGCCATCTTAGAACAAACACCATTTAGCAGCTCGGCCCATACAATTCATCTATTAGTTATGGATCGCAGAAAAATGCTCCATTTTTCTGATGTTAATGCCGTTCATGGTGAAATTCCTGAAAATGCTCCACCAGGGTCAGTTGTAGATTGTTCTCCTATAAAAGCCACTGCTTTAGTAAATGTTGGACCCATTGCTTATAAGATAACAAAAGGTAATGATCAATCTGTCTTCGCCCTTCGTGAAAAATCGCGGCCTAGTAATTCCGATATCAAATCGGTAATAACAGATGGAGACGTAGAAATTGTTGCTATGAAACCATTGGACAGTGAAACTAAGAACCTTTACGATTTAGTAATTCAAGCGACTGATTTACACGGAGCAAATAAAGCTAGCCTTCCGGTTCGCGTCAACGTTATTAATGAGAACGATCATGAACCGATTTTTGAACAAGAAATATATTATTTCGCTGTTAATGGCACCTGTGATGATAATTGTCACAACGGCACGGCTTATTGGCAACGCTTTTCTACTATTGGAAAAGTTCGCGCTTTTGACGCCGATGGTGATAGAGTATACTATTCACTAAAAGCCCCATCGAATCTTGCGGTAATTGTACCCCAAACTGGAGAATTAATACTGGCTGGTGAACCGGATGGACATGAGGCAGAGCTGGAAGTTTTGGCTCATGACGTAGGAATCCCGCCACGGAAAAGCCAACCGGCGCAAGTTTTCATTGAATTTGTAATACGCGAACGAAAAGACATGCCCACACTGCATCGTGAAAAGAGAAGAGTCACTCGAGCGGTTCGTCCTACTAAACGTATAGAATTCACCGAGGCCGATGGCGAAGTTGAAGGGCGTGCGGTGTTTACTCTCGAAAAAGAAACCGACCGTGAGACATTTAAAATAAGGGACGAAAATCCTTGGGTTACCGTTGAACCTAGTGGTGTAGTGAAAGTTAAAAAGAAGTGGGACTATGAGGAGTTGGGACCTGAGAAGACTATTGATTTCTGGGTCACTATCACGAACGCTGGGAATGGAGTCAACCTGCTCACACCTGTCGTATCGAATGGTGCAATGGCGCATGACATTTACAGACAGGTCTCGTCTTTGATGGCGTTCCCATCACGACCGATGTAA

Protein sequence:

>DPOGS215849-PA
MAHQVVVGGALLFLLCFALNTSSLILLPHDVRPGHAVRHFIGNHSRYTLLDPEYEPFFTLLDDGLLMTTADLGPLLNQPLNLAILEQTPFSSSAHTIHLLVMDRRKMLHFSDVNAVHGEIPENAPPGSVVDCSPIKATALVNVGPIAYKITKGNDQSVFALREKSRPSNSDIKSVITDGDVEIVAMKPLDSETKNLYDLVIQATDLHGANKASLPVRVNVINENDHEPIFEQEIYYFAVNGTCDDNCHNGTAYWQRFSTIGKVRAFDADGDRVYYSLKAPSNLAVIVPQTGELILAGEPDGHEAELEVLAHDVGIPPRKSQPAQVFIEFVIRERKDMPTLHREKRRVTRAVRPTKRIEFTEADGEVEGRAVFTLEKETDRETFKIRDENPWVTVEPSGVVKVKKKWDYEELGPEKTIDFWVTITNAGNGVNLLTPVVSNGAMAHDIYRQVSSLMAFPSRPM-