Monarch geneset OGS2.0

DPOGS206619
TranscriptDPOGS206619-TA1998 bp
ProteinDPOGS206619-PA665 aa
Genomic positionDPSCF300048 - 850385-880018
RNAseq coverage104x (Rank: top 60%)
Annotation
HeliconiusHMEL0120230.070.04% 
BombyxBGIBMGA000941-TA1e-13877.62% 
DrosophilaSema-2a-PB0.061.17% 
EBI UniRef50UniRef50_Q243230.061.17%Semaphorin-2A n=55 Tax=Pancrustacea RepID=SEM2A_DROME
NCBI RefSeqXP_968340.10.068.06%PREDICTED: similar to semaphorin 2a [Tribolium castaneum]
NCBI nr blastpgi|910777640.068.06%PREDICTED: similar to semaphorin 2a [Tribolium castaneum]
NCBI nr blastxgi|910777640.067.90%PREDICTED: similar to semaphorin 2a [Tribolium castaneum]
Group
Gene OntologyGO:00055152.9e-169protein binding
GO:00160202e-06membrane
GO:00072752e-06multicellular organismal development
GO:00048722e-06receptor activity
KEGG pathwaybfo:BRAFLDRAFT_2798572e-93 
 K06842 (SEMA6)maps-> Axon guidance
InterPro domain[26-464] IPR0016272.9e-169Semaphorin/CD100 antigen
[13-481] IPR0159431.4e-152WD40/YVTN repeat-like-containing domain
[482-530] IPR0162012e-06Plexin-like fold
Orthology groupMCL10354 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206619-TA
ATGCTCGAAGACGCGACAGTGCGAGTGAAATATGCAAAAGATCATGTACGGGAATTCTCTTGTGGCACTCTATACTACAGAACCTTCTTTGTGGATTCCAAACGCGACGCTCTCTACGTCGGCGCTATGGATCGACTCTACAGACTTAACATGAACAATATAAGCGCTTCGCATTGCGATAGAGATTCTATTAATTTGGAACCAAGCAACGTAGCTCAATGTGTCTCTAAAGGGAAATCAGAACACTTCGATTGCCGCAACCACGTGAGAGTGATACAACCTATGGGTGATGGTAGTCGCCTTTACGTGTGCGGAACCAACGCCCATAGCCCTAAAGACTGGGTATTATATTCTAACCTGACACACCTGCCACGCTACGAGTACGTGCCGGGCATCGGCATGGGTGTTGCAAAATGTCCTTACGATCCTGCTGACAACTCAACCGCCTTGTGGGTGGAAAAAGGCAACCCTGGTTCTCTTCCTGCTCTGTACTCCGGAACCAACGCAGAGTTTACTAAAGCGGACACTGTGATCTTTAGAACCGACTTGCACAACTTGACAACGGGACGACTGGAGTTTTCTTTTAAACGTACTTTGAAATACGACTCTAAATGGCTTGACAAGCCTAACTTCGTTGGATCTTTCGACGTTGGGGAATATGTGCTGTTCTTCTTCCGCGAAACAGCTGTGGAATATATTAATTGTGGAAAAGCAGTGTACTCTAGAGTAGCGAGAATTTGCAAGAAAGACACTGGCGGCAAAAACATCTTAAGTCAAAACTGGGCTACTTACTTGAAGGCGCGCCTAAACTGTAGTATACCCGGGGAATTCCCATTCTACTTTAACGAGATCCAGAGCATCTACAAAGTTCCTGGGGATGACAGCCGTTTCTACGGTGTATTCACAACTGCTTCAACAGGATTGATGGGTTCAGCAATTTGCACATTCACCATCGCTGACATTCAGAAAGCATTTGAAGGAAAATTTAAAGAACAAGCAACTTCAAGCTCCGCCTGGCTTCCAGTCATCAGCAGCCGAGTACCTGAACCCAGACCTGGCAAATGCGTGAATGATACTTCTTCATTGCCAGATACAGTGCTCAATTTTATAAGGTCACATCCACTAATGGACAGCGCGGTATCTCATGAACACGAAAAGCCTATTTACTATAAAAGAGATCTTTTCTTCACTCGTCTGGTTGTGGATCGAGTAAAAGTGGACATGATGGGACACCCTCTTGAATACACCGTTTACTATGCGGGGACGAATGAAGGTAAAATTCATAAAATCGTTCAATGGTCTCGGAACGGGGACAGTCAGTCAGCTTTACTTGACATCTTCGATGTGACTCCCGGCGAACCAATTCAGGCTATGGCACTTTCGCGTATACACGGATCATTATACGCGGCGTCAGATCGGCGCGTGCTCCAGCTACGACTTGCTCTCTGTGCGCGACGTTACGACGCTTGTGTTCGCTGCGCTAGAGATCCGTACTGCGGTTGGGATAGAGAAGCGGGAGTATGCAGGGAATACATGCCCGGCTTGATACAAGATGTCGCGAACGAAATGACTTCTATGGACAATTCTTATGGATTTGATAAGTACCCTGATATTTTTTTACTTGCTATTTTCAATGTTACGATCCTGATACAAGATGTCGCGAACGAAACAGCAGACATTTGTGATACTTCGATCTCAAGGAAAGTGGTATCAGCGACGTGGGGACAGAGCTTGCATCTGGGCAGCTTCGTTAAGATGCCAGAGGTTCTACACCCGCGAGCCATCTCGTGGTATCATTACACAAGAGATGGAAGACATGCTGTTAGTTTCAAATGTTCTCCGCCCGAGAAGAGCAATGAGTACCAGAAGATATATTCAAACTGGTGTCACGAATTTGAAAAATATAAGACGGCTATGAAAACATGGGAAAGAAAGCAAGAGCAATGTGCACGTCAAAATGAATCCAATCAGAACACGCATCCCAACGAGATCGTGTGA

Protein sequence:

>DPOGS206619-PA
MLEDATVRVKYAKDHVREFSCGTLYYRTFFVDSKRDALYVGAMDRLYRLNMNNISASHCDRDSINLEPSNVAQCVSKGKSEHFDCRNHVRVIQPMGDGSRLYVCGTNAHSPKDWVLYSNLTHLPRYEYVPGIGMGVAKCPYDPADNSTALWVEKGNPGSLPALYSGTNAEFTKADTVIFRTDLHNLTTGRLEFSFKRTLKYDSKWLDKPNFVGSFDVGEYVLFFFRETAVEYINCGKAVYSRVARICKKDTGGKNILSQNWATYLKARLNCSIPGEFPFYFNEIQSIYKVPGDDSRFYGVFTTASTGLMGSAICTFTIADIQKAFEGKFKEQATSSSAWLPVISSRVPEPRPGKCVNDTSSLPDTVLNFIRSHPLMDSAVSHEHEKPIYYKRDLFFTRLVVDRVKVDMMGHPLEYTVYYAGTNEGKIHKIVQWSRNGDSQSALLDIFDVTPGEPIQAMALSRIHGSLYAASDRRVLQLRLALCARRYDACVRCARDPYCGWDREAGVCREYMPGLIQDVANEMTSMDNSYGFDKYPDIFLLAIFNVTILIQDVANETADICDTSISRKVVSATWGQSLHLGSFVKMPEVLHPRAISWYHYTRDGRHAVSFKCSPPEKSNEYQKIYSNWCHEFEKYKTAMKTWERKQEQCARQNESNQNTHPNEIV-