Monarch geneset OGS2.0

DPOGS200984
TranscriptDPOGS200984-TA1305 bp
ProteinDPOGS200984-PA434 aa
Genomic positionDPSCF300147 - 386083-390797
RNAseq coverage1888x (Rank: top 7%)
Annotation
HeliconiusHMEL0074973e-7176.96% 
BombyxBGIBMGA009053-TA1e-17471.93% 
Drosophilasl-PA7e-0736.47% 
EBI UniRef50UniRef50_D6WUR86e-9447.10%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WUR8_TRICA
NCBI RefSeqXP_973345.11e-9447.10%PREDICTED: similar to predicted protein [Tribolium castaneum]
NCBI nr blastpgi|910896192e-9347.10%PREDICTED: similar to predicted protein [Tribolium castaneum]
NCBI nr blastxgi|910896193e-9547.21%PREDICTED: similar to predicted protein [Tribolium castaneum]
Group
Gene OntologyGO:00055157.5e-20protein binding
KEGG pathwaygga:4259782e-13 
 K08273 (SH2D2A, VRAP)maps-> VEGF signaling pathway
InterPro domain[327-425] IPR0009807.5e-20SH2 motif
Orthology groupMCL18585 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200984-TA
ATGCTGCAACAGATCTTACGAGAGATGTGGGTTGATCCTGAAATCCTTGCTGAGCTCGACGAGACACAAAAACAGACGCTCTTCTGCAAGATGAGAGAGGAACAGGTGCGTCGATGGCAGGAGTGGGATAAAAAGGTCGGAAAACATGAGAATGAAACACGAAACCAGCAGAATGGAAAGAAGCAGGTGCAGTTCCTTAAAGGGGAGGACGGAGAGCCCTGGGTTTGGGTGATGGGAGAGCATCCAGACGATAAGTCCATCGAGACGATCCTGTGTGAGGAGACGAGAGCTCGGGCGGTCGCGCAGGCGAGGGTGGAGGCTCACCAACTGAGGAAGTCCGTGGAGAAGGAACTCACACATCTGATAGACTACAAGCCTCTCGATAGCTTGGAGCAGAAATTTGACATATCTCCAAAGAACGTGGATTCGTTGGAAGATACTTTGGATCTATATTGTTCAGTGGACGAGTTGAGGCAGAGGATCGAGGAATTGGAGCCGGAGGTGAAGGAGAAAAGCGACAGTGAGGCTGACAAAGATTACTGCGAGGAGCATGGCAAAATGAACCTTAAGAAGAACACTTTACAGTTCAATTTCATTGAAGGGAAGAAAGATGTTATACAAGACTCCGGGCGCGCTGGTGACGGCGTGTCCTTGCGTGTGGCGGCCTGGGAGAGGAGGGTTGCCGCGGCCAGGGCCGGGGACATCCTGCGGGGGCTCCGGGCCAGGAGGGCTCGCACGCTCAGGGATGCGCAGGCGCAGGCTCAAGGAGGAGACGCGCTGTGGAGGGAACAGGAGCGTAAGGCTAAGGAGGCGGAGGCTGCTATGAGGGAGATCGCTCGAGCGGCCCGAGAGGTTCACCGGCGGACATCTCACCTAGCGCCCGCGTGCGCCCTGCCCGCGAGCAAACCACCTAACAGAGAAGCCGTTCTGGACTGGTTCAAAACCAAGGAACTGCCGAAAGGGGTCGGCCTGGACGAGAACCACAAACCAGTCGACTGGTTTCATGGTCTCATCAGTCGCTGCCAGGCGGAGCAGCAGCTGCAGCGCTCGGCGGCGGGCAGCTTCCTGGTGCGCGTGTCGGAGCGGGTGTGGGGTTACGCCATCTCGTACCGCGGGGAGCGCACCAAGCACTACCTGGTGGACGCCGCGGACGGGTACAGTCTGCTGGGGGCGGGCCAGCTCCGACACGAAACGCTGGCCGATCTTATAAACTACCACAAGAGAGTGCCCATCACGGAGAGCGGCGGCGAGTTGCTGACGACGCCGTGCGCGCCCTCTGGCGACAGAGAGTCACACCTAATGTAA

Protein sequence:

>DPOGS200984-PA
MLQQILREMWVDPEILAELDETQKQTLFCKMREEQVRRWQEWDKKVGKHENETRNQQNGKKQVQFLKGEDGEPWVWVMGEHPDDKSIETILCEETRARAVAQARVEAHQLRKSVEKELTHLIDYKPLDSLEQKFDISPKNVDSLEDTLDLYCSVDELRQRIEELEPEVKEKSDSEADKDYCEEHGKMNLKKNTLQFNFIEGKKDVIQDSGRAGDGVSLRVAAWERRVAAARAGDILRGLRARRARTLRDAQAQAQGGDALWREQERKAKEAEAAMREIARAAREVHRRTSHLAPACALPASKPPNREAVLDWFKTKELPKGVGLDENHKPVDWFHGLISRCQAEQQLQRSAAGSFLVRVSERVWGYAISYRGERTKHYLVDAADGYSLLGAGQLRHETLADLINYHKRVPITESGGELLTTPCAPSGDRESHLM-