Monarch geneset OGS2.0

DPOGS209476
TranscriptDPOGS209476-TA1932 bp
ProteinDPOGS209476-PA643 aa
Genomic positionDPSCF300275 + 273752-276658
RNAseq coverage489x (Rank: top 25%)
Annotation
HeliconiusHMEL0044910.073.07% 
BombyxBGIBMGA005854-TA0.060.32% 
DrosophilaCG16857-PA5e-13942.68% 
EBI UniRef50UniRef50_Q9U4G17e-13742.68%BcDNA.GH11322 n=21 Tax=Endopterygota RepID=Q9U4G1_DROME
NCBI RefSeqXP_001603297.14e-14143.40%PREDICTED: similar to GA14181-PA [Nasonia vitripennis]
NCBI nr blastpgi|910814275e-13841.78%PREDICTED: similar to neuronal cell adhesion molecule [Tribolium castaneum]
NCBI nr blastxgi|3320309725e-13843.59%Protein turtle [Acromyrmex echinatior]
Group
Gene OntologyGO:00055154.4e-11protein binding
KEGG pathwayecb:1000545752e-26 
 K06757 (NFASC)maps-> Cell adhesion molecules (CAMs)
InterPro domain[368-461] IPR0089577.7e-22Fibronectin type III domain
[270-362] IPR0137831.7e-16Immunoglobulin-like fold
[289-355] IPR0035984.8e-13Immunoglobulin subtype 2
[372-452] IPR0039614.4e-11Fibronectin, type III
[283-366] IPR0035992e-10Immunoglobulin subtype
[89-177] IPR0130982.3e-09Immunoglobulin I-set
Orthology groupMCL14645 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209476-TA
ATGAGTTTAAGTCAAAGTGAGATCAAGGGCGAGGTAATATTTTCGTGGTACAGCAGTGAGGGCAGGGCGCAGCTGGCGGACCGCTGGGGCGGACGCGTGAGGAGGGTCTTTTCCCCAGGGCTCGGCCGCGGCTCCGTCAACATCAGCTCCGTGAGGGAGACGGACGCGGGCCTCTACCGCTGTCGCGTCACCTTCCCCAACCGCACGCCGCCGGCTCGTAACAACGGGACCTTCTATTACCTCGACGTCGACGGCGGCAACCTCATCGTCACTCCGCCCGTCAACGTCACAGTCCTCGAGGGTGATCGCGCCGAGTTGGAATGTCTGCCCAAGAGCCCGGAGTGGACGGTGCAGTGGTACCACGAGGGGGTGCCGGTGGAGACGTTGCCCGAGTTTGCGCAGCGGTCGCAGTTGGCGGTTAACGGCAGTCTAATAATGAAACAAGTCGTCAGCACTGACCTCGGCGAGTACGAGTGTCGTATCAGTGCTGCTGACGGACAGACTCAGAGCGCTAGTGCCTTTTTAGATGTTCAATACAAGGCGAAAGTTGTCTATTCTCCCAAAGAAAGGTATTTACCATATGGCAAATCGGCGAGCCTCGATTGTCATTTCAGTGCGAATCCTCCTTTAACAAACCTACGATGGGAAAAAGATGGTTTCCTATTCGATCCCTACAACGTACCCGGCGTATTTTATAGTATAAACGGTAGTCTGCTCTTTAATCAGGTAGATGAATCCCACGAGGGAATGTACTCATGTACGCCTTACAACGTGCTGGGGTCGGCGGGTCCGTCAGCCGAGGTCCGTGTGCGCGTGGCTCGTCCTCCGGCGTTGGTCGTGCGACCTCTGCCTTTGTACTTGGTGAGGCTCGGAGCTACTGTCACGTTGCCTTGTGCCGTCGCCCGCGAGCCGCATCACGCACCACCAGCCATTCACTGGATAAAGAAAGATGGAACTCCTCTGCCTGAAGGACGTTATTCGCTAAGCGAAGGCAATTTGACCATCACTCAAGTATCGGAAGAAGACCGCGGTGTCTATGTGTGCTCTCTGAGTAACGAGGCCGACGAGCTTGCCGTTGAAACTGAGTTGTTGTTAGAAAACGTACCTCCGCGTGCGCCCTACAACCTCACGGCCAAGTCCACTGCCAACTCAATTCACCTGTCGTGGGTTCCAGGACACAATGGCATGGACGTCGAGTACAACGTGTGGTACCGCGAGCGAGCGGACAGCGAGTGGCGCACAATGAAACTCTTGTCGCGAGGCTCGACTCATGCTACGTTGCTAGCGCTTCGCCCCGCCACCGAGTACGAACTGCGAGTACTCTCACAGGATCACATCGGCGACGGTTTATTTAGTAAACCGATATTCGTGCGGACCCTAGATTCTGATCCCGGTGAAGATAGAAGTACTACAGCCGGTGTTTCGGCCACTGTAACAGAAGGCATTATGAACACGGGCACCTCTTGGAGTAGTGCAACCAACGAGGCGAGCGGAGCGACGCTCGGCTCGGTGAGTGGTTCGGTCGGAGGTCCGATCGGAGGGTCGGGCACCTTGGACGAGCTCGGGGTTACCGTCCGGCTCATCGAGGAAGGAGCGCTGATCCGATGGAGTCGCGCGCCTGGAGACGACACACTCTGCTCAGTGCGTTGGTACGACACCGACGAACCGCAACATAAACTTATAGCTGTCTTTTCGACGCATCAAGATTATATTCTCGTGAGTCCAGTAGTGGAGGGTGCGGGTTACTGGGCCGGCGTGGAGTGCGGCGAAGTCATCGGAGGCACGGGTGTGCAGGTCCCGCAGTACACCCGCCTGCGGGGAGTAGCGGCAGGGTCCGCGGCTGCGGCGCTCCTGCTGGGAGCCCTGGCCGCCGCCCTGTGCCTGGCCCGCCGCCGCCTGCGACCTCACGCCCGCGACAAACGCTCCCGCTAA

Protein sequence:

>DPOGS209476-PA
MSLSQSEIKGEVIFSWYSSEGRAQLADRWGGRVRRVFSPGLGRGSVNISSVRETDAGLYRCRVTFPNRTPPARNNGTFYYLDVDGGNLIVTPPVNVTVLEGDRAELECLPKSPEWTVQWYHEGVPVETLPEFAQRSQLAVNGSLIMKQVVSTDLGEYECRISAADGQTQSASAFLDVQYKAKVVYSPKERYLPYGKSASLDCHFSANPPLTNLRWEKDGFLFDPYNVPGVFYSINGSLLFNQVDESHEGMYSCTPYNVLGSAGPSAEVRVRVARPPALVVRPLPLYLVRLGATVTLPCAVAREPHHAPPAIHWIKKDGTPLPEGRYSLSEGNLTITQVSEEDRGVYVCSLSNEADELAVETELLLENVPPRAPYNLTAKSTANSIHLSWVPGHNGMDVEYNVWYRERADSEWRTMKLLSRGSTHATLLALRPATEYELRVLSQDHIGDGLFSKPIFVRTLDSDPGEDRSTTAGVSATVTEGIMNTGTSWSSATNEASGATLGSVSGSVGGPIGGSGTLDELGVTVRLIEEGALIRWSRAPGDDTLCSVRWYDTDEPQHKLIAVFSTHQDYILVSPVVEGAGYWAGVECGEVIGGTGVQVPQYTRLRGVAAGSAAAALLLGALAAALCLARRRLRPHARDKRSR-