Monarch geneset OGS2.0

DPOGS208476
TranscriptDPOGS208476-TA1941 bp
ProteinDPOGS208476-PA646 aa
Genomic positionDPSCF300064 - 1444535-1446475
RNAseq coverage77x (Rank: top 65%)
Annotation
HeliconiusHMEL0146500.086.07% 
BombyxBGIBMGA010659-TA0.080.09% 
Drosophilakek1-PA1e-14744.73% 
EBI UniRef50UniRef50_D2A2162e-17151.23%Kekkon-1 n=1 Tax=Tribolium castaneum RepID=D2A216_TRICA
NCBI RefSeqXP_973226.14e-17251.23%PREDICTED: similar to kek1 [Tribolium castaneum]
NCBI nr blastpgi|910817657e-17151.23%PREDICTED: similar to kek1 [Tribolium castaneum]
NCBI nr blastxgi|910817655e-17450.38%PREDICTED: similar to kek1 [Tribolium castaneum]
Group
KEGG pathwaydme:Dmel_CG41925e-69 
 K07523 (NGL1)maps-> Axon guidance
InterPro domain[261-362] IPR0137831.6e-17Immunoglobulin-like fold
[263-360] IPR0130982.9e-12Immunoglobulin I-set
[268-361] IPR0035991.1e-09Immunoglobulin subtype
[213-261] IPR0004831.5e-06Cysteine-rich flanking region, C-terminal domain
Orthology groupMCL16544 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208476-TA
ATGAAGTGGTTCGTAGTAGCAGTGACTGTGATGGTGACGAAAACTATCTTAGCTGGGATTGCGCCTCCTGGATCATGTCCAGCTGTTTGTGGGTGCAAATGGAAAGGTGGAAAACAGACCGTGGAATGTGTAGATCGAGCTCTAATAACTATACCGGAGCCCGTCGATCCTGCTACGCAAGTACTTGACCTTTCAGGGAATAACTTACAGATTCTTCCTCAAGAAGCTTTTGCAAAAACGGGCCTCGTTAACCTTCAGCGAGTATATCTCCGAAACTGTAATATAGGGCAAATAAACGACCGGGCATTTAAAGGTCTAACTAACTTAGTGGAATTGGATCTCTCTTATAATTTACTCACTCAGATACCTTCAAATAGCTTCAAGGATGCGCCTTTTCTTAGAGATCTTATGCTCTCTAATAACCCTATATTGAAGATACATTCGGAATCTCTCCATAATTTGGGAAGTATCGTTAAGCTGGATTTGTCAAGATGTGATATTAGAGATATTGCAGCAGATGCATTTAAAAACTTACATTCATTAGAATCTTTAAAGCTGAATGGTAATAGTTTGCGTGACTTACCTTTAACCTCCTTAGAAAAGCTAGAAAAACTACGAGTTATTGATTTATCTGAAAATCCTTGGACATGTACTTGCCGGCTACGTGACTTAAAAATGTGGCTTTCAAAACATAAGCTATTCTCATCTCCCAGTTGTTCATCACCGAGTCGTCTTGCAAATAAACCATTTTCGGAACTATCTCTTGAGGAATTTGCTTGTAAACCAGAAATATTACCCATTAATAGATATGTGGAAGCAACTGTAGGAGAAAATGCTACAATTGTTTGCAGAACAGAAGCTATTCCAAGTCCTAATATAAATTGGTATTGGAATGGACGCCTTCTACAAAATGGAAGTAGTTTCAATTCACATCAAAGAATCTTTATTTATGAAGCTGGTGACCGGAAAAAAAGATCCACTCTCGTTATAACGAATACACAAGACACAGACTTTTCTGAATTTTATTGTGTTGCCGAGAACAAAGCAGGGAATGCTGAAGCTAACTTCACAATACATGTCACTCAAATGACTGCTGGAATGGCGTCTTTAGGAAGCGCCCAAATTGCGAGCCTGGGAGCTGCATTGTTCTTAGTTATAATTGTGATTTCATTAGGACTGCTTATTACTTTCGTGCGATTCCGGCCGGCTCCGGCTTGTGAAAGTAAAACGCCAAATACGTTAGACAGAGTTGTATCTGGCAATGAAGTTCATCCTACCGTAACAGACAGGCCCCATGTAGCTGTGTTGGCTAACAGACAAGAATCTCCAAATTATGATGAAACAAAATGCAACCCAGTACTTAAGCCTCCAAGAACGAATGATATACCTTATACAACTAATCATTATGAAGGTCGGGGTAGCTTAGTCCAGGCAATGGGACCGCCGGTTGTATCTCCCACCGTCTCCGGAGGTATCGATCCCGACCTCATTAACGATACGAGACCCGATAGCGTCAATCGACCAGGTAGCGGAGAATATGCGCGTGAAGCTTCCGATTCATTATATCCATCCGGTCTGTGGGATCAAATAAAAATGAATCAGGCTAACAACTTGGCACGAGCGATCAGTTCCGCAATTCCAGCGTATTATAACGATCGCACACCCATAATTGAAAACAGTAGTGTTAACGGCTCTCAAGAAGAGCTTGGCTATATGAGTCGTACCTTCCCACGATCCCATGCAATCGCGGCAGCTAGTACAGCACCAGGTGATGCTCCTTATCCTGCTGATTATGGATTGCCAGTTGGCGGTGCGCGCACACTACGTGTGTGGCAGCGTGCTCCGCCAGTGCTACCCCCAGTCTCGGCATTGAAACGTGTGCTAACTATCACGAGGCCATCGGAGGATGGTTTTCAGGATGGCTGTGCTACAGATGTTTAA

Protein sequence:

>DPOGS208476-PA
MKWFVVAVTVMVTKTILAGIAPPGSCPAVCGCKWKGGKQTVECVDRALITIPEPVDPATQVLDLSGNNLQILPQEAFAKTGLVNLQRVYLRNCNIGQINDRAFKGLTNLVELDLSYNLLTQIPSNSFKDAPFLRDLMLSNNPILKIHSESLHNLGSIVKLDLSRCDIRDIAADAFKNLHSLESLKLNGNSLRDLPLTSLEKLEKLRVIDLSENPWTCTCRLRDLKMWLSKHKLFSSPSCSSPSRLANKPFSELSLEEFACKPEILPINRYVEATVGENATIVCRTEAIPSPNINWYWNGRLLQNGSSFNSHQRIFIYEAGDRKKRSTLVITNTQDTDFSEFYCVAENKAGNAEANFTIHVTQMTAGMASLGSAQIASLGAALFLVIIVISLGLLITFVRFRPAPACESKTPNTLDRVVSGNEVHPTVTDRPHVAVLANRQESPNYDETKCNPVLKPPRTNDIPYTTNHYEGRGSLVQAMGPPVVSPTVSGGIDPDLINDTRPDSVNRPGSGEYAREASDSLYPSGLWDQIKMNQANNLARAISSAIPAYYNDRTPIIENSSVNGSQEELGYMSRTFPRSHAIAAASTAPGDAPYPADYGLPVGGARTLRVWQRAPPVLPPVSALKRVLTITRPSEDGFQDGCATDV-