Monarch geneset OGS2.0

DPOGS201985
TranscriptDPOGS201985-TA1377 bp
ProteinDPOGS201985-PA458 aa
Genomic positionDPSCF300060 - 24047-30180
RNAseq coverage429x (Rank: top 28%)
Annotation
HeliconiusHMEL0040402e-6260.43% 
BombyxBGIBMGA010554-TA4e-12458.46% 
DrosophilaCG15630-PA3e-3126.12% 
EBI UniRef50UniRef50_B0W5V25e-3727.49%Putative uncharacterized protein n=3 Tax=Culicidae RepID=B0W5V2_CULQU
NCBI RefSeqXP_001844086.11e-3727.49%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700324332e-3627.49%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastxgi|1700324337e-3727.49%conserved hypothetical protein [Culex quinquefasciatus]
Group
KEGG pathwaybfo:BRAFLDRAFT_1224602e-17 
 K06491 (NCAM)maps-> Cell adhesion molecules (CAMs)
    Prion diseases
InterPro domain[123-213] IPR0137833.2e-17Immunoglobulin-like fold
[122-196] IPR0130981.5e-14Immunoglobulin I-set
[123-210] IPR0035996.6e-10Immunoglobulin subtype
[129-197] IPR0035988.3e-10Immunoglobulin subtype 2
[306-451] IPR0089574.2e-08Fibronectin type III domain
Orthology groupMCL18361 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201985-TA
ATGCCATATGCCATTATATTATCTCTGACAGCTGAAACGGAAGATGGAGCACCATTCTTGCACGTAATGGCCAAATCATCGGTTTTCAATGTCGGTGAAAAGAAAGCAATCTACTGCAAGGGGCTTAATTTACCAGAGAGGATAGACTGGGTGTCACCATCTGACGAAGTAGTGGAAATAAGATCATCCAGGAACACCAGGGTGTATGTTGAAAGGCACAAGAACGACACCTTATCCAGCAGCCTGGTACCGCTCATTTTCCATAGCATCAAGATTAAGGATAGCGGCAATTGGACCTGCAAAGCTGGAAACCTCAATGAAACCATCGAAATTCTTGTCGGTGAAAAAGTAAGTCTTAGTGAGAGGCAAGAGACTCTGGAGGGTGAAGAGACAAAATCTGTCAAATTGGATTGCGTTGCAAAGGGATATCCAGCACCGGTCGTGCAATGGTATAAGGATTCCGTTTCGATTGTCGACGACCACAAGAAGTTTATCGTGAGGAAAAAAGATAACAACTATCAGTTGGAAATCAAAAACTTGACACATCAAGATACAGGCGAATATATGTGCAAAGTAACGCAAAAGGCACTCTCATACTATACGTACAAACGTGTCCTGCTGTCTGTCCAACATAAACCTATTCTGATTAATGAGGCCACCAATGAGGTTTACTACACTAAATATAGAACTGAGGAAGTTTACGCCATCTTGAATGAAACGAAAAATATTACATGCAGCGCTCTAGCTTACCCACCACCAACATACACCTGGTCCAGGAGGAGAAACAACTTCGACGATGACCCCATTAATGAAGAGGATACGATTCATTCAGCGGACGGTACGAGCTCTGTGCTAGTATTGCGAATGTACAATGAGAGTAATCTCGGAGAATACAAATGTGCTGCGAAGAATGATAAGGGACACGTCTCAGTTGTATTTCACGTGTCCTTGGGCAATAAACCGAATCCACCCGACTTTCTCACCTTGGCGTCTCGTACAAAGACAGAACTAACATTCAACGTATCTTGCTCAACTTGCAATATGGCGATTGAAGAAGACGATAAATCTCAGGATCCCGAGAATCTAACTGTGCTTGGATATTCTTTCCAACTTGTTCCTGCACAGGAAGGATATTTGCCTGATTGGGCAGCAGCCACAGAATTTGAAGTTAACTTTGAATATTACAATGAATCATTGTTCACTGTGGGACCATTGCAAAATCAAACTACGTACCACGTGAGGGTTCGCACACGCAACGCAGCCGGCTATTCTGAATGGGTTGAACTCTCAACAACTGTGACTACAGACTTCGCCGTCAAACTCACCGCCTCCATTATTTTAATGTTTGTTACACTGTTTATAACGCGTTGTTATTAA

Protein sequence:

>DPOGS201985-PA
MPYAIILSLTAETEDGAPFLHVMAKSSVFNVGEKKAIYCKGLNLPERIDWVSPSDEVVEIRSSRNTRVYVERHKNDTLSSSLVPLIFHSIKIKDSGNWTCKAGNLNETIEILVGEKVSLSERQETLEGEETKSVKLDCVAKGYPAPVVQWYKDSVSIVDDHKKFIVRKKDNNYQLEIKNLTHQDTGEYMCKVTQKALSYYTYKRVLLSVQHKPILINEATNEVYYTKYRTEEVYAILNETKNITCSALAYPPPTYTWSRRRNNFDDDPINEEDTIHSADGTSSVLVLRMYNESNLGEYKCAAKNDKGHVSVVFHVSLGNKPNPPDFLTLASRTKTELTFNVSCSTCNMAIEEDDKSQDPENLTVLGYSFQLVPAQEGYLPDWAAATEFEVNFEYYNESLFTVGPLQNQTTYHVRVRTRNAAGYSEWVELSTTVTTDFAVKLTASIILMFVTLFITRCY-