Monarch geneset OGS2.0

DPOGS212305
TranscriptDPOGS212305-TA1614 bp
ProteinDPOGS212305-PA537 aa
Genomic positionDPSCF300286 + 281949-283721
RNAseq coverage44x (Rank: top 72%)
Annotation
HeliconiusHMEL0170010.079.66% 
BombyxBGIBMGA009335-TA9e-17181.87% 
Drosophilakek6-PA4e-5632.18% 
EBI UniRef50UniRef50_D1ZZM32e-8236.67%Putative uncharacterized protein GLEAN_08070 n=1 Tax=Tribolium castaneum RepID=D1ZZM3_TRICA
NCBI RefSeqXP_974068.13e-8336.67%PREDICTED: similar to kek1 [Tribolium castaneum]
NCBI nr blastpgi|910809296e-8236.67%PREDICTED: similar to kek1 [Tribolium castaneum]
NCBI nr blastxgi|910809294e-8437.17%PREDICTED: similar to kek1 [Tribolium castaneum]
Group
KEGG pathwaydme:Dmel_CG41925e-26 
 K07523 (NGL1)maps-> Axon guidance
InterPro domain[215-322] IPR0137831.2e-10Immunoglobulin-like fold
[167-216] IPR0004838.4e-09Cysteine-rich flanking region, C-terminal domain
[218-322] IPR0130982.5e-08Immunoglobulin I-set
Orthology groupMCL25601 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212305-TA
ATGAGTATGCGCGTCTCTTGGGCCAGGTGGTATTGGATAGTTATCTTCACAACGTGCACCGCAAGGACGGCCAGTGATTGGCTTGATTGCGCCCACATTGTGACCTGTCGTTGCAAGTGGTCCTCGGGAAAGAAAACGGCCACGTGTGCATCCAGTGACTTACGCCGCCCACCATCTCTTTCTTCAGATATCCAAGTACTGGATTTACATGACAACCCGTTAAGAAATCTACCACAAGAAGTTTTTTCGAATATTGGGCTCCTTAATTTACAAAGGATAAATTTACGAGCAACTAAACTTAGATCTATACACGCAGATGCTTTTTTAGAATTAAGGATATTAATAGAAGTTGATTTAGCTGATAACGATTTGGCGATATTGCCACGGAATATATTCAGAGAGACGATAGACTTACGAAGAAACCAATTAACATTCCTACAACTGGCTACTTTCCAGCCTGTATCTTTGCCAGCTTTAAAGACTATGTTTTTATCTGGCAACCCATGGCGTTGTGATTGCCGATTGCGCGAATTTAAGGATTGGTTTTTGGATAGCACTTTAGGCACAGAAGAATTGGTATGTGTGGAGCCTTCATCTCAGTCTGGAAACAAATGGAGAAACGTGCCTAGTGATATAATGGCTTGCCCTCCTGAAATCAAATCAAGCACACTGGTTGTTAGAGCAGAGGTAGGTCTAGCTGCTTCTTTTGGTTGTTGGGTCCATGGAATTCCAAAGCCTTCAGTTACTTGGCTTTTAGATGGTGTGGAAATTCATAACAGTACTATCGATTGCGATATTGAGGAAACAGATACCATTATCGAAGAAAATGATGAGACCCTTGATCGAGTTGCGGGTAGCGCTAGATGGGTGAATATTACTTTATTAAATGTGACTTCAAATGCAGCTGGAGAATGGACGTGTATTGCCAATAGTGTAGCGGGAGAAGCCAGAGCAGTAATAAGCCTTGTTTTGCCTCGATCACAAACAGCTACAGCTCGAACTGCTCCAGGTATACCGCAATTATTAGGAGTGGTTTTTGGAGCTTTAGGAGCTTTAGCGACATTAGGATTTATAGCAGCTATAGCTTGCTGGCATTTGAGAAAAAGAACAGTTCCATCTAGTCGAAGTTTTATGGATCAAGAAAAGCGGTTAATAGATGCATCTGTTGTAATTAGTTGCGACCGTTCCATTGCTGATATGGCTTCACCGTGTGATTTTGAACTTACTGAAAGATCATTGTCTGTTGATGACCATCCGAGAGGGTGTAGTTTTGATCCTGTTCATATTACAATTGAAGGCACACCAGGTGCATTCCCACCACCACCAGCCGAATTTGCAGTTCCAGTTCCTTACGGCAATATCTTTATTTCTGTCCAAGTATCTGGAAGGAGTGAACCAGCAAAATATCCCGATTTGCTAAGTGGTGGTACTACGTTACCTCGAAGAAGTAGAACATGCTGTACAGCACCAGCATACGACAACATGGGTCCGAGAGTTACAGCAACAGGAAGTTCGACTTGGTCTTTACCTGGTGCTTCAACAGAAGCAGCTGAACAAACGGAAACCCCAGTCCTGACGTTACCTCCACCACCACCAGAATTTGTTTCACTTTAG

Protein sequence:

>DPOGS212305-PA
MSMRVSWARWYWIVIFTTCTARTASDWLDCAHIVTCRCKWSSGKKTATCASSDLRRPPSLSSDIQVLDLHDNPLRNLPQEVFSNIGLLNLQRINLRATKLRSIHADAFLELRILIEVDLADNDLAILPRNIFRETIDLRRNQLTFLQLATFQPVSLPALKTMFLSGNPWRCDCRLREFKDWFLDSTLGTEELVCVEPSSQSGNKWRNVPSDIMACPPEIKSSTLVVRAEVGLAASFGCWVHGIPKPSVTWLLDGVEIHNSTIDCDIEETDTIIEENDETLDRVAGSARWVNITLLNVTSNAAGEWTCIANSVAGEARAVISLVLPRSQTATARTAPGIPQLLGVVFGALGALATLGFIAAIACWHLRKRTVPSSRSFMDQEKRLIDASVVISCDRSIADMASPCDFELTERSLSVDDHPRGCSFDPVHITIEGTPGAFPPPPAEFAVPVPYGNIFISVQVSGRSEPAKYPDLLSGGTTLPRRSRTCCTAPAYDNMGPRVTATGSSTWSLPGASTEAAEQTETPVLTLPPPPPEFVSL-