Monarch geneset OGS2.0

DPOGS200394
TranscriptDPOGS200394-TA2286 bp
ProteinDPOGS200394-PA761 aa
Genomic positionDPSCF300121 - 306820-310348
RNAseq coverage18x (Rank: top 80%)
Annotation
HeliconiusHMEL0146200.078.10% 
BombyxBGIBMGA009485-TA0.063.88% 
Drosophilakek5-PB2e-9036.35% 
EBI UniRef50UniRef50_UPI00022CA3043e-10035.71%UPI00022CA304 related cluster n=1 Tax=unknown RepID=UPI00022CA304
NCBI RefSeqXP_394632.28e-10535.76%PREDICTED: similar to kekkon5 CG12199-PA, isoform A [Apis mellifera]
NCBI nr blastpgi|3800202857e-10537.23%PREDICTED: uncharacterized protein LOC100863140 [Apis florea]
NCBI nr blastxgi|3800202853e-10735.52%PREDICTED: uncharacterized protein LOC100863140 [Apis florea]
Group
KEGG pathwaydme:Dmel_CG41921e-50 
 K07523 (NGL1)maps-> Axon guidance
InterPro domain[265-363] IPR0137838.8e-17Immunoglobulin-like fold
[268-362] IPR0130988.5e-14Immunoglobulin I-set
[279-354] IPR0035983.1e-09Immunoglobulin subtype 2
[216-265] IPR0004833.8e-09Cysteine-rich flanking region, C-terminal domain
[273-365] IPR0035992.3e-07Immunoglobulin subtype
Orthology groupMCL25615 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200394-TA
ATGGCTCACAAATGTTGGAAGACATGGCTAATTATATACACTATATCATTTCTTACAATTAGCACCGCCGATTTTACAACAGAATGTCAAAGACCTTGTGACTGTAGGTGGCAGTCTGGTAATAAAGCTGCAATTTGTTCCAACTCAAGCTTGAGGGTTGTACCAGCTAATTTAAGCAGCGATATACAGATATTAGATCTATCTAACAATAATCTATTACAACTACATCAAGAAGCCTTCAAAAGGGCAGGCCTGAGCAATCTTAAGAAACTTTTTCTAAGAGATTGTAATATTGAAACAATTCACAAGGCAGCGTTTGTGACTCTTGCAATTATGATAGAACTGGATCTATCAAAGAACAGGATCCGATATCTTCACCCGGACACCTTCAAAGGCACGGAAAAACTAAGATTAATAAACTTGAATAACAACTTTATCGATAAACTAGAAGATGGTCTATTTCGCAATTTAAAGTATCTTCAAAAAGTTGAAGTGAGCAACAATAAGATCTTCCGAATTGGAACAAAGGCATTTCTAAACTTGCCTCAACTCAAGATATTGAGAATAGACGGTAACAATTTGAGTCATATGAAACCCGAAACTTTAATGGCGTTACGAAACTTGTCTGGACTTGATTTGCACAACAACCCGTGGCGGTGTGATTGCAACTTACAATCGTTTAGAGATTGGGTTATCACTCACAACTTGTACACTCCACCAACGGTATGTGCGGAACCCGCTTCGATACGCGACAAACTTTGGTACGAATTAGATTCTTCAAACTTTGCTTGTCGGCCTACTATATTAGAACCCCTGCCAGATGCAACTATCAAAAGTTATGAGGAAAACGTGACGCTGATTTGTAAAGTAGTTGGCAATCCACCTCCAGAAGTTGTGTGGAGGTTCAATGGTAAAACAATTGAGATACGATCTTTCGGAGAAATACGTTATAATGTTATGGAGAACACAATGGATCTTATTAGATGGGTTAATTTAACCATACTTCATACCAGGTACAGTGATAGGGGAAATTACACATGTGTAGCTGAAAACCCAGGAGGAAGGGATGAAAAGACTTTAACGCTTATTTTATCTAAATATGGCGCTGCAGGAACCATAGCTGGAATGGACGCAGATTCTTTTGCAATTCTTGTGGGATGTTTATTATCTATCGCCATAATCGTCGGAGCTGTGTTAACCGTCTGCTATTTCACAACACAAAATGGCGAATTAAAAAGACTGATTAAAACTGATAATCGATCATCAAACGGAGAAGCTTTAATAGAAGGCTCTGTGGCGTCCGAAGTAGAAAAAGTGTGTAAAACCGAAGTTAATCCTATGGCCAAGCCAGCGAATAAATACGAATCTACTATGGTCAATACTGCGACAGAAATGTTGGAAATAAAAAAGACATTAATCGACACCGACTCTAGATTGGTTTCCACGGAGAGACATAATCATAAAGAATCAGAATTTCCTAAACATACCAAAGAGTTATTATTGGAGCGCTTACCCCAAGATTCCCAGACTTATCCACCGGATCTTCTTTCATTTCCTCCCCGCCCGAGTCAAATATCACCAGCAAGTGGAATTTCTTTGGACAGGCCACACATCTCAACAAAAATGGAGAGTCCAAAATCATCAAATTGCGGCATGTCGCCTACAAACACGGCAATTTATACGAGACTACCTAATTCGAGCTATCAAGAACCTTTAGTAAGTAAAGGATATGTGACTTTGCCGAGAAGACCTCGATCAGCTTATCCCGAGGATAATCTCCGACCACAAATCTTTTCTACCTTGAATGGTGTCATACCATACTACGATAATTTTAATATGAAGTTTTTTGGTAATGGAGGTAATTATTACAGTCTCAATAAAAGTGAAATCGATTTAGGTCCAGTGAATAAAATCGGTTTTCTAGATGCTTCTGACGATATAGAACCGGCCCCATCACCTGCTCCGGGGACTCCTCATGCCAATATACCGAGAAACAGCTTGAGTTCTCCAAACATTCATAATCAGTTATTGATGTTACAAGCTATGTCAAATGGAAATCTAAGAAACTCGTTAGAAAATAGGCACTCTAAAGTAACCCTTACAGATAGCGATAGCCTTTTAAAGACTTCAAGTAGAGACTTGAAAATAGGTTACAGTGTAAACGTTGCGAATACACTGAGCAAATCACGGAATATGATGAATCCTCCACCAAAAACAAGGAAAAGACATTCCGGGGAAGTAAAAGAAACATTTCTAAATACTATTGAAAATGCTACACAAGTGTAA

Protein sequence:

>DPOGS200394-PA
MAHKCWKTWLIIYTISFLTISTADFTTECQRPCDCRWQSGNKAAICSNSSLRVVPANLSSDIQILDLSNNNLLQLHQEAFKRAGLSNLKKLFLRDCNIETIHKAAFVTLAIMIELDLSKNRIRYLHPDTFKGTEKLRLINLNNNFIDKLEDGLFRNLKYLQKVEVSNNKIFRIGTKAFLNLPQLKILRIDGNNLSHMKPETLMALRNLSGLDLHNNPWRCDCNLQSFRDWVITHNLYTPPTVCAEPASIRDKLWYELDSSNFACRPTILEPLPDATIKSYEENVTLICKVVGNPPPEVVWRFNGKTIEIRSFGEIRYNVMENTMDLIRWVNLTILHTRYSDRGNYTCVAENPGGRDEKTLTLILSKYGAAGTIAGMDADSFAILVGCLLSIAIIVGAVLTVCYFTTQNGELKRLIKTDNRSSNGEALIEGSVASEVEKVCKTEVNPMAKPANKYESTMVNTATEMLEIKKTLIDTDSRLVSTERHNHKESEFPKHTKELLLERLPQDSQTYPPDLLSFPPRPSQISPASGISLDRPHISTKMESPKSSNCGMSPTNTAIYTRLPNSSYQEPLVSKGYVTLPRRPRSAYPEDNLRPQIFSTLNGVIPYYDNFNMKFFGNGGNYYSLNKSEIDLGPVNKIGFLDASDDIEPAPSPAPGTPHANIPRNSLSSPNIHNQLLMLQAMSNGNLRNSLENRHSKVTLTDSDSLLKTSSRDLKIGYSVNVANTLSKSRNMMNPPPKTRKRHSGEVKETFLNTIENATQV-