Monarch geneset OGS2.0

DPOGS203819
TranscriptDPOGS203819-TA1491 bp
ProteinDPOGS203819-PA496 aa
Genomic positionDPSCF300010 + 2251291-2253641
RNAseq coverage4x (Rank: top 89%)
Annotation
HeliconiusHMEL0133314e-14257.05% 
BombyxBGIBMGA003724-TA8e-9745.79% 
DrosophilaCG12869-PA1e-4629.29% 
EBI UniRef50UniRef50_D6W7F99e-7036.33%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6W7F9_TRICA
NCBI RefSeqXP_972696.25e-7036.27%PREDICTED: similar to AGAP010390-PA [Tribolium castaneum]
NCBI nr blastpgi|2700147533e-6936.33%hypothetical protein TcasGA2_TC005165 [Tribolium castaneum]
NCBI nr blastxgi|2700147531e-6737.04%hypothetical protein TcasGA2_TC005165 [Tribolium castaneum]
Group
KEGG pathwayhsa:544135e-40 
 K07378 (NLGN)maps-> Cell adhesion molecules (CAMs)
InterPro domain[4-496] IPR0020183e-92Carboxylesterase, type B
Orthology groupMCL18796 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203819-TA
ATGTGGTTGTGTTTAGGTGCATTTAGTGTTATATCAGCTCAAGATCCTGTAGCAAATTTGCCTATGGGCCGAATTGTAGGGATTAAGGTATTTACTGAAAATTCACTTATCCCCATTGAAGTGTTTTTTGGAATACCCTATGCGCTGCCGCCGATTGGAAGATTAAGATTTTCTCCTCCAGAAAAACATCCAGGATGGAAAAGAACTTTGTTTGCCCATCGTATGCCACCACGCTGTCCGCACCCTAGCAATGATACTCCAAACTTTAACGAAGACTGCCTCTATCTTAACATATGGACTCCTCGGCGGGTGGATGGAAAATTGCTCCCAGTCATGGTTGTATTATATAGTGAATCTTGGAACAAGGGCGGAATAACTCTACCTTGTCAGGATTTAGCTTCAGACGGGCTCGTGGTCGTTACTGTGGCATTCAGACTTAGCGTATTTGGTTTTTTTACTTTAAAATCAATTTTGGCTAGGGGAAACCTTGCTCTGCTTGATCAATATCTAGCTATAGTTTGGGTAAGGGAAAACATTGCTGCATTCGGCGGAGATCCTAATTTAATTACATTAGTTGGCCATTCAAGCGGAGCAGACAGCGTTCTTTTACATTTAGCATCACCACGAACAACAGGTCTATTTCAACGAGCTATAATAATGTCTCCGAAAAATATTTGGAAATCAATTGAAAAAGATAAAAATTCTCCGACGCAGAAGATCGTTCATTTATCAGAATCCATAACAGAATCACTGGGTTGTTTAGAAGAAACAATTCAGAAAACTTTACAGTGCTTAAGATCCCGTTCCGTGGCTGATTTCTTGGGACAATACACGAATATTTGGACCGACTTATTTGAGCCAATTCCTGATGACTTTCTACCAGAATCTGAACAATATTTACCAAAATCACTGGCAACATCCTTTTCATCAACCAGCTCCAAAAAGATTAATCTTGATGTGTTAATGGGAACTACTAATCTAGAGGCCATTGATTTAGAAAAATTCAAAAATTCTTTAAGAGACGGTCCTAAAACAAGAGAGCTTAATAATATAACTAGTATTATATCTGAAACCTTACATTTTCTTTCACTTGATCGCCCAGAAAACGAATTTATTCTTTCGCAGGCAATCTTCTGGGAATTTTTAGGATTTAAAACTCGCCAAGAAAGCGAACAGGATGTTATTGGAATTTTAGAAGATGTAGGTAGAATGGAAACATCTGCAAAATGGGGAGCAGGGTCTGCTTTGTTGGCGGCAAAGTTTGCACGAAAAGTATCTCGATTGTATGTGTACCGTTTCTTGCAACCTAACTATGTGGACTTGCATGGCTTCCAACTGAATTTTACAGGTGCTACAAACGGTGCCGAGTTATTTGCATTGTTGGGAGACGCTCTCATGCTCCAAGCAGCACGACGTCCGTTTTCACAGACCGAAAAAAGAATATCATTGAGATTTCGAAGCTTTATTTCAAATTTTGCCAAATTTGGGTAA

Protein sequence:

>DPOGS203819-PA
MWLCLGAFSVISAQDPVANLPMGRIVGIKVFTENSLIPIEVFFGIPYALPPIGRLRFSPPEKHPGWKRTLFAHRMPPRCPHPSNDTPNFNEDCLYLNIWTPRRVDGKLLPVMVVLYSESWNKGGITLPCQDLASDGLVVVTVAFRLSVFGFFTLKSILARGNLALLDQYLAIVWVRENIAAFGGDPNLITLVGHSSGADSVLLHLASPRTTGLFQRAIIMSPKNIWKSIEKDKNSPTQKIVHLSESITESLGCLEETIQKTLQCLRSRSVADFLGQYTNIWTDLFEPIPDDFLPESEQYLPKSLATSFSSTSSKKINLDVLMGTTNLEAIDLEKFKNSLRDGPKTRELNNITSIISETLHFLSLDRPENEFILSQAIFWEFLGFKTRQESEQDVIGILEDVGRMETSAKWGAGSALLAAKFARKVSRLYVYRFLQPNYVDLHGFQLNFTGATNGAELFALLGDALMLQAARRPFSQTEKRISLRFRSFISNFAKFG-