Monarch geneset OGS2.0

DPOGS207403
TranscriptDPOGS207403-TA2064 bp
ProteinDPOGS207403-PA687 aa
Genomic positionDPSCF300087 - 351606-353963
RNAseq coverage1486x (Rank: top 9%)
Annotation
HeliconiusHMEL0214790.080.71% 
BombyxBGIBMGA009370-TA0.073.68% 
DrosophilaCG16974-PA4e-13741.45% 
EBI UniRef50UniRef50_D2A0220.053.69%Putative uncharacterized protein GLEAN_08146 n=3 Tax=Endopterygota RepID=D2A022_TRICA
NCBI RefSeqXP_975439.10.054.00%PREDICTED: similar to AGAP008611-PA [Tribolium castaneum]
NCBI nr blastpgi|910810670.054.00%PREDICTED: similar to AGAP008611-PA [Tribolium castaneum]
NCBI nr blastxgi|910810670.053.92%PREDICTED: similar to AGAP008611-PA [Tribolium castaneum]
Group
KEGG pathwaydre:5537853e-26 
 K07523 (NGL1)maps-> Axon guidance
InterPro domain[254-367] IPR0137831.7e-13Immunoglobulin-like fold
[267-366] IPR0130982.1e-09Immunoglobulin I-set
[264-356] IPR0035981.7e-07Immunoglobulin subtype 2
Orthology groupMCL16525 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207403-TA
ATGATTAGGTTGAAGAAGTTGGATTTATCTTATAATAGGATCGCCACTTTCTTAGATTACTTTTTTAAGCCTAATAGCCAACTGAAAACTCTGTTCCTAAATAATAATAGTCTAGTTAAAATTACATCTCATTCTCTAGTCAGCCTTAAGGAATTGGAAACTCTCGATCTATCTAGCAACAAATTAGACAATATTCCTAAATCTATATTCGACAGCCTGGAACAGTTGAAGGATATCAATCTAGGTTACAACAACTTCGATAACATATCACAGGATATATTTAAAAAGCTGAATAAGTTAGAGACGCTTAATTTAGGTGGAAATAGACTTAAGGTGTTGCCCCCGACGCTGTTCCAATATAACGAAAACTTACTTGCAGTCTACCTCGAACACACGGCAATTACCGTCATACAGAACACCAACTTTAAAGGCTTACAGAAACTCAAGCGTCTGTACGTCAGGTACAACTCCATGTTACGGGAAATAGAACCCTTCGTGTTCCAAGACACGCCGGCCTTGACTCATCTGGACATTACTGCCAACGCTCTGACATACCTGCCGCTATCACTGCAGATGTTAGAGAACTTACAGGAGTTGAGAATAGGGAATAACTCGTGGGCCTGTGACTGTAGGATGGCCTGGTTCGTCAGCTGGATAGAGAACAGAAAAGATATCGTGAGGTCGGATTTGAGTTGTCGGTGGGCTTACAGGGACGACATGCTGAGGTATCTGAACAGCACCAACTGTAAACCGCCACAATTGATAAAGAGCAGTCCACTGTCGTTACACAGGCTCCAGACGGACGCACTGCTGGAGTGCAAGTTCGCTGGCAACCCAGCCCCATCCATAACCTGGATAACGCCAACGAGAAACGTGTTCCATTGGAATCCAGATTCATCCCCGCCGGACATATTCCAGAAACATGGAATAGCACACGATCAGTATTACCGCCCCTTAGATTACAGTAACTCGAGAGTCAAAATCCGCGACGACGGTTCCCTCTTCATAGCCGACATCCATCGTGAGGACAGCGGCACCTACCTGTGCCTGGCCTCGAACCCCTCGGCCAACGCCACCGCCGAGGTCGTCCTCAACATAGATCCCATGACCATGTTAGAAATCAAAATCTATAGTTTGCTGTTCGGAGCGATGTGTGCACTGAGTTTCTTGGGTCTCACTTTGCTAGTGCAGGCTCTGCGTTATATATTCTATAGATATCGTCTCCTAGAGACCTGCTGTAGCTGTTGTACTTGTGTGAATCGCGACGCGCCGCGAACCAGACAGATATATCACATGCTGGACAACATCGAACAATATAAGAGACAACAATTGGAAAAACTCCGAGAAAACTACGCCGTTCAGGTACATCGAATCAAAGAGAACTGCACCCAGCAGATGGAGTGGATTCAGAGTAGCTACTCGACACAAGCGGCTCACTTGAGAAATATCAGAGATATCGGAACCAACCATTTGACGTCTATGAAAGATCAGTACTACGATCAGGTGAAACGCGTCCGCGAGTACTCCACGTGCCAACTGAACTGGGTCAGAGAGAACTACGTGTTCCAGAGGAACAAGATACGGAAATTCAGCGCGCACCAGATACTGAGGCTGAGAGAGTCGTACAAATATCAGCAGCAGACGCTCAACAAAGTGCTGGAGAACCTGCCCAGTCTGTACTTCGAGAACTGCAGGAGCGGTTCGTGCGGCCGGAGCGACTCCATGGCCTTCGACCCCGACGTGGAGGTCATCGACATGTACCTCAAGACCAAGATCGAGAAGCTGGCGCGTCTGCCGCCGCACTTCGACGACGAGAGCAAGGTGTCGGTGTACTACACGCCCACCGAGCGCTCGCTGGAGTCGCGGCGGGGCTCGCTGGCGCGCGCGGAGCCCGAGGCCGTCCACATCAACATGATCGAGCGCCCCCCTTCGTGTGTCGTGCGAGTCGGTAGTGTCGTTTGTTGTGACGGTGTGTGCACTGGCCGACGGCCCTCCCCCCCTCTCCCCTGTCGGCCTGTCGGTGTCCTTATTATTATTACTATTAGTCGTAGTGTCACCGTCTAA

Protein sequence:

>DPOGS207403-PA
MIRLKKLDLSYNRIATFLDYFFKPNSQLKTLFLNNNSLVKITSHSLVSLKELETLDLSSNKLDNIPKSIFDSLEQLKDINLGYNNFDNISQDIFKKLNKLETLNLGGNRLKVLPPTLFQYNENLLAVYLEHTAITVIQNTNFKGLQKLKRLYVRYNSMLREIEPFVFQDTPALTHLDITANALTYLPLSLQMLENLQELRIGNNSWACDCRMAWFVSWIENRKDIVRSDLSCRWAYRDDMLRYLNSTNCKPPQLIKSSPLSLHRLQTDALLECKFAGNPAPSITWITPTRNVFHWNPDSSPPDIFQKHGIAHDQYYRPLDYSNSRVKIRDDGSLFIADIHREDSGTYLCLASNPSANATAEVVLNIDPMTMLEIKIYSLLFGAMCALSFLGLTLLVQALRYIFYRYRLLETCCSCCTCVNRDAPRTRQIYHMLDNIEQYKRQQLEKLRENYAVQVHRIKENCTQQMEWIQSSYSTQAAHLRNIRDIGTNHLTSMKDQYYDQVKRVREYSTCQLNWVRENYVFQRNKIRKFSAHQILRLRESYKYQQQTLNKVLENLPSLYFENCRSGSCGRSDSMAFDPDVEVIDMYLKTKIEKLARLPPHFDDESKVSVYYTPTERSLESRRGSLARAEPEAVHINMIERPPSCVVRVGSVVCCDGVCTGRRPSPPLPCRPVGVLIIITISRSVTV-