Monarch geneset OGS2.0

DPOGS208584
TranscriptDPOGS208584-TA2358 bp
ProteinDPOGS208584-PA785 aa
Genomic positionDPSCF300064 + 1858275-1862281
RNAseq coverage25x (Rank: top 77%)
Annotation
HeliconiusHMEL0223010.082.68% 
BombyxBGIBMGA010594-TA0.070.98% 
Drosophilakek3-PB3e-10945.50% 
EBI UniRef50UniRef50_Q9V4304e-10745.50%IP22191p n=13 Tax=Drosophila RepID=Q9V430_DROME
NCBI RefSeqXP_969542.12e-13639.23%PREDICTED: similar to GA18017-PA [Tribolium castaneum]
NCBI nr blastpgi|910813113e-13539.23%PREDICTED: similar to GA18017-PA [Tribolium castaneum]
NCBI nr blastxgi|2700052072e-14040.05%hypothetical protein TcasGA2_TC007226 [Tribolium castaneum]
Group
KEGG pathwaydme:Dmel_CG41922e-107 
 K07523 (NGL1)maps-> Axon guidance
InterPro domain[480-481] IPR0137831.2e-21Immunoglobulin-like fold
[251-349] IPR0130982.5e-19Immunoglobulin I-set
[263-339] IPR0035986.4e-15Immunoglobulin subtype 2
[257-350] IPR0035994.9e-11Immunoglobulin subtype
Orthology groupMCL18894 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208584-TA
ATGCTTGCGTTGGCGGTGGCCGTCAGTGCCGAATGCCCTCGTCACTGCGAGTGCAAATGGCGGAGTGGTAAAGAATCCGCGCTATGTGCTCGCGCCGGCCTGAATGCTATACCACCACGTTTAGATCCAACAACTCAGCTCTTGGATCTGGCTGAAAATCGAATATCTGTACTTAAAGATGATGCTTTTGCAGAAGCTGGGCTTCTGAATCTTCAACGTCTATACATTCCTGCCTGCAATTTAAAAAGTATAAGGCAATATGCATTTCGTGCCCTAGTCAATCTGGTGGAACTGGATCTGTCTAGAAACCGTTTAGAGACGGTGCCCTCACAGGCTTTTGAATCAATTCCTGAACTCCGAGAACTTCGACTGAGCGGTAACCCAATAGTAAAAATAAAAGATGATGCATTTTTATCCCTACCCCATTTAGTAAAATTAACCTTGAGTGATTGTAAGATAATTGAAATAGAACACAGAAGTTTTAAGGGTTTGGAAGGTTCATTAGAATATTTGGAACTTAATAAGAATAAACTTCAAATTTTACATGTTGCCATATTAGCACCCCTGAGGTCATTAAAAGGACTTGAACTTGCAAATAATCCATGGGATTGCAACTGTGCACTTAGACCTATGCGAGATTGGATGATAAGAAAAAATGTTCCCGCAACTGTTGTGCCAGACTGCGCGTTACCTCCGCGATTAACATCACAATCTTGGGATCGCTTGGATCTAGAAGATTTTGCATGTTTACCAGAAGTAACCGGAACCTTGAACAACATCAAAGGTATAGAAGGTGAAGAAGTTACCCTCATATGTCAAGTAAGTGGCGTGCCAGCACCAAGAGTGAGATGGATTCGATCGGGGCGACTTTTAGCAAATACTACTATATCAAGTAACCTCAATTCGGGTAGAATTTTTTTACTACGTAGTGAAGGACAAACAAGTAATTTAACAATAAAGTCCGCAGATATTCAAGACTCTGGACAATACACGTGTAATGCGGAAAACAGAGCTGGTAAGGCGGAAGTTGTTTTAAATTTAACCATTGAGAAAAAACCTCATAGTAGAGGTTTTGGTGGTAGAGCCCTCATGGCTGGGATGGCTGTGTCTGCAGTAATAATACTAAGTTCATGTTTAATAGGATTGTGTGCATATGAGACCCGTAAGAAACGTCAGTTAGATAGATGGAATGAACAGATTGTGACGTCGAATCATCACGATAATAATTACGAAAAAATAGAAGCTAATTTAAAAACAACCGAAGAAGTTTCAAGGGTTGTTGTTTCAGAAAATAACAGTCGAAAAAGAGGAGACTATAGAAATGTACCATCAGACGACCCAGAAGACGAATACGAAGGTTATGGAAGAGACGAAGTGAATAGAAAAAGAAGTGAGTCTTCGCCTCCACGAGACATTAGATGGCGACGTATTGAATATGACCCACCTGTTTCTAGAGCAGGTGAATTAGTTCAAGATCTTCATATACCAAGATTGAGGGAATATAACTCAGGAAACGACGCTACCAGTGAAACGATACAGACATCTAAGTCGGCTGGGCTTTTAGCAGGTCACGTTGAGATTTTATCTGAAAATAATAGGTTTAATAATAACGCTCGCTCATGTCCTCGACCACGAGCACGCGATCGCCTTGATAATGATCTGTCTGGTAGCGACAGTGAAAAGAACTATCCGGATCTAATAGAAATGAGTGCCTTGGGCACGACGTCCTATTATCGAGGTGATATAAAACATGATCCTTATTATTTCTACACAATTCCCAGGAGAAAAGATGGTGACAGCCGAAGTCCTCTATTAAGTAGCCGAAGAAATAGCTCTGGAGGTGATTCTATCACCTTTCTTGATAAAAGCTATGACAAAAGTCAACAAAGATCCAGTGGACGGAGGTCAAATAGTTTTCTAGATCTTTCGGTCGGTGGTAACAGAATGCGTCGAAATCCTAGCTTGCCAGCCTCGCCATCCAGAGAGCAGTCATCAGTTCCGTCCGCCACACCTCTTTTGGACTTATCTGGCCTTCGAGACTATTCGAGAACTGGTCAGCCTTTCGAGGATTTTGACTTTCGTGCATCTCAGTTGGAGAAATTCCTAGAAGAATATAGAAGTTTAAGGGAACAACTTTCTAGAATGAAAGAAACAAGAGAAAATCTTCAAAGAACAAGAGCAGTTGAAAGTGAGGAGTTGCGAACAATCCACAAAGGTAAACCTAGTATAGCAGTGACAGAAACGTCATCTTCTGTAACATTAGCAGATGCAGCATCTCCGTTGGCATTAAGTCCATCAGAATATAAACCTCAACACACAAGGCCAGAATGGTTGACCACTCTACTATACCGCAACTAA

Protein sequence:

>DPOGS208584-PA
MLALAVAVSAECPRHCECKWRSGKESALCARAGLNAIPPRLDPTTQLLDLAENRISVLKDDAFAEAGLLNLQRLYIPACNLKSIRQYAFRALVNLVELDLSRNRLETVPSQAFESIPELRELRLSGNPIVKIKDDAFLSLPHLVKLTLSDCKIIEIEHRSFKGLEGSLEYLELNKNKLQILHVAILAPLRSLKGLELANNPWDCNCALRPMRDWMIRKNVPATVVPDCALPPRLTSQSWDRLDLEDFACLPEVTGTLNNIKGIEGEEVTLICQVSGVPAPRVRWIRSGRLLANTTISSNLNSGRIFLLRSEGQTSNLTIKSADIQDSGQYTCNAENRAGKAEVVLNLTIEKKPHSRGFGGRALMAGMAVSAVIILSSCLIGLCAYETRKKRQLDRWNEQIVTSNHHDNNYEKIEANLKTTEEVSRVVVSENNSRKRGDYRNVPSDDPEDEYEGYGRDEVNRKRSESSPPRDIRWRRIEYDPPVSRAGELVQDLHIPRLREYNSGNDATSETIQTSKSAGLLAGHVEILSENNRFNNNARSCPRPRARDRLDNDLSGSDSEKNYPDLIEMSALGTTSYYRGDIKHDPYYFYTIPRRKDGDSRSPLLSSRRNSSGGDSITFLDKSYDKSQQRSSGRRSNSFLDLSVGGNRMRRNPSLPASPSREQSSVPSATPLLDLSGLRDYSRTGQPFEDFDFRASQLEKFLEEYRSLREQLSRMKETRENLQRTRAVESEELRTIHKGKPSIAVTETSSSVTLADAASPLALSPSEYKPQHTRPEWLTTLLYRN-