Monarch geneset OGS2.0

DPOGS203319
TranscriptDPOGS203319-TA921 bp
ProteinDPOGS203319-PA306 aa
Genomic positionDPSCF300003 - 867178-869046
RNAseq coverage224x (Rank: top 44%)
Annotation
HeliconiusHMEL0088123e-15686.27% 
BombyxBGIBMGA002079-TA2e-9979.74% 
DrosophilaEph-PC2e-2732.71% 
EBI UniRef50UniRef50_Q6VU509e-10281.58%Eph receptor n=7 Tax=Neoptera RepID=Q6VU50_MANSE
NCBI RefSeqXP_966603.23e-5048.91%PREDICTED: similar to ephrin receptor isoform 1 [Tribolium castaneum]
NCBI nr blastpgi|343982733e-10181.58%Eph receptor [Manduca sexta]
NCBI nr blastxgi|343982734e-10681.58%Eph receptor [Manduca sexta]
Group
Gene OntologyGO:00055151.4e-18protein binding
KEGG pathwaydme:Dmel_CG15112e-25 
 K05110 (EPHB1, ELK, NET)maps-> Axon guidance
InterPro domain[3-186] IPR0206942.7e-30Tyrosine-protein kinase, ephrin receptor Dek-like
[229-297] IPR0137611.9e-20Sterile alpha motif-type
[232-293] IPR0211298.7e-19Sterile alpha motif, type 1
[226-303] IPR0109931.4e-18Sterile alpha motif homology
[228-295] IPR0016602e-17Sterile alpha motif domain
[24-143] IPR0089571.2e-16Fibronectin type III domain
[31-139] IPR0137832.2e-16Immunoglobulin-like fold
[44-131] IPR0039611.5e-13Fibronectin, type III
Orthology groupMCL10101 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203319-TA
ATGGGATTGGAACCAGTAACAGAATATAAATTTCAAGTATTCGCTTTAAATGGTGTATCTGACTTAACTGGTGAATCACCAAAAAAGGTAGAAATAACAGCAGTTACTGAAGCATCTGTTGTTAGTGTAATAACTAAACTGAGAGTAGTAAGTGTTGAAAGTGACAAGTTATCTTTAGCTTGGAACCCGCCACCCATTGATTTAACGGACCCTGATGATAGTATAGAAAGCTATGAAGTCAAATGTTTCCCTAAAGATCATATGGAAAAAAGTGCAAATGCTACTGTTAGGATTACTAAGGAACCACATGTTATAATAACAGGTCTGAAAAGAGACACCGAATATGGCATAAGAGTTAGGGCAAAAATGAAAAAAGGCTGGGGTGAATTGAGTGGCATTGTTTATGCTCGTACTGGATCTGTACTTGAAACATCATTTGTTGGAGAAGAAGAAGGTGCTCAAGTTCGCTTGGTTGCTGGTGTTATGGTAGCCGTTGTTGTACTTTCAGTTATTGCAATTATTGCAACTGTTCTATTCCTACGGTCACGATCTGATGATGAATGTGATAAAAAGCAACCAAGTGATTGTAATGCATTGAATTATCGTAATGGAGAAGTTTATACTGGTCCTGACAGACCTGCAAAGACAGGAAGCAATGCAACAACACCTTTATTTGCAGGAACTGATATGATTCAGTTTTCTTCGGTTGAAGAATGGTTGGAATGTATCAAAATGTCTAGATACATTGAAAAATTCAGAACAGCTGGCATCACAGACATGAATGCCGTTGTGGATCTCACAGTTCATCAACTTGCATCCTTAGGTGTGACATTGGTCGGGCATCAAAAGAAGATCATGAACAGTGTTCAAAGTATGCGTGCTCAAATACGCGTCAGTGGTCCTTATGGTTTTCTGGTGTAA

Protein sequence:

>DPOGS203319-PA
MGLEPVTEYKFQVFALNGVSDLTGESPKKVEITAVTEASVVSVITKLRVVSVESDKLSLAWNPPPIDLTDPDDSIESYEVKCFPKDHMEKSANATVRITKEPHVIITGLKRDTEYGIRVRAKMKKGWGELSGIVYARTGSVLETSFVGEEEGAQVRLVAGVMVAVVVLSVIAIIATVLFLRSRSDDECDKKQPSDCNALNYRNGEVYTGPDRPAKTGSNATTPLFAGTDMIQFSSVEEWLECIKMSRYIEKFRTAGITDMNAVVDLTVHQLASLGVTLVGHQKKIMNSVQSMRAQIRVSGPYGFLV-