Monarch geneset OGS2.0

DPOGS205028
TranscriptDPOGS205028-TA3633 bp
ProteinDPOGS205028-PA1210 aa
Genomic positionDPSCF300288 + 155003-166319
RNAseq coverage12x (Rank: top 83%)
Annotation
HeliconiusHMEL0071490.082.70% 
BombyxBGIBMGA010362-TA0.064.13% 
DrosophilaDscam-PBX5e-6572.30% 
EBI UniRef50UniRef50_G6CLL70.088.95%Dscam n=5 Tax=Pancrustacea RepID=G6CLL7_DANPL
NCBI RefSeqXP_002080407.10.048.94%GD10266 [Drosophila simulans]
NCBI nr blastpgi|1955811700.048.94%GD10266 [Drosophila simulans]
NCBI nr blastxgi|1955811700.048.38%GD10266 [Drosophila simulans]
Group
KEGG pathway 
InterPro domain[2-94] IPR0137833.8e-10Immunoglobulin-like fold
Orthology groupMCL25730 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205028-TA
ATGAAGGAACCCCCAAATCGAGTTGACTTCAGCAACACGACCGGAGCCGTTGTCGAATGTGCCGCACGAGGATCCCCCGCCCCTGACGTCATCTGGGTACGAGCTGATGGAACCGCTGTTGGTGATGTCCCCGGATTGAGACAGGTTCTGCCGAACGGTAACCTGGTATTCCCTCCATTCAGAGCTGAAGATTATCGTCAGGAAGTACACGCTCAGGTATACGCTTGCTTGGCAAGAAACCAGGTTGGAACCATTCACTCCAGAGACGTCAACGTGCGAGCTGTGGTCGCGCAACACTATGATACCGACGTTAACAAGGAGTACGTGATAATGGGAAACAGCATTATACTTAAATGTCAGGTCCCCTCCTTCGTGGCTGATTTCATTGAAGTTCTATCGTGGCATACCGATGAGAAAGAAGATTTTTACCCTGGCGAAAATTATGTTGTTCAGCAACAGTATGAGTCTGAAGTAAACAACGAATATGTGATTAGAGGAAATTCCGCTATTCTCAAATGTTCCATACCATCATTCGTGGCTGATTTTGTCAATGTCATATCGTGGCATGATGAAGCCGAGAACTCGTATACTATAAATGGAACAAAAGAGGGAGAGGTTGTCACCCAATATTATGAAGCTGAAGTAGTCTCAGAATATGTAATTCGTGGAAATACTGCGGTATTGAAATGCAACATCCCTTCTTTTGTTGCTGATTTTGTTAAAGTAGAAGCTTGGGTTGACTCTGATGGTGGCGAATATTTGCTAACCGATGATATCGTTGTTAATCAATTTTATGAAGCCGAAATCCTTACTGAGTACGTTATAAGAGGCAACAGTGCTGTTTTGAAATGTTCAATCCCATCGTTTGTGGCGGATTTTGTCAAAGTTGAGGCATGGATTGACGAAGAAGGAACTGAAATAACGCTCATTGATAATCTTGTTGTGTCACAGTATTACGTGACTGAAGCCGAAAACGAGTACGTGATTAGAGGGAACGCTGCTATTGTTCACTGCAAGATTCCTTCATTTGTTAGTGACTTTGTTTACGTCGAATCTTGGATTATGGATGATGGCGAAATACTTATGATTAGTAATACAAATATGACCGTGGTATCACAACCGTATGAGGCTGAAGCTGACAACGAATATGTCATAAGAGGAAATGCTGCTATAATGAAATGTGAAGTGCCAAGCTTTGTATCTGACTTTGTCTATGTCGAAATGTGGACAGACAGCGACGGTGGTACTTACTTTCCAGGAAATGCAGAGGCGGTGCTCCAAGTGTACGAAGCTAGAGTCAACGATGAGTTTGTGTTACGAGGAAACACTGCCATTTTAAAATGCATCGTGCCTTCTTTCGTAGCAGACTTTGTCTATGTTGTAGCGTGGTTGATGGATAATGAGACGGTCACTGCCAATGAAAACACTAATATCGACTCCGTCGTTCATCAAAATTACGAACCACGCGTTATCGATGAAGATGTACTACGTGGCAATTCAGCAATTGTTAAGTGTCTAATCCCGTCATTTGTAGCTGATTATGTACAAGTTGTTGAATGGTTAACCGACGAAGAATCGCTATCGGTATTTTCGCCGAATGACCCCGAAGGCAATTATGCTGTAAATCAATTCTACGAGTCGCAGGTTTATGATATATATGTTATACGTGGCAATGCCGCAGTTTTCAAGTGCCATATTCCATCTTTCGTATCTGATCACGTGCAAGTACTTTCTTGGCACGATAGTGAAGGGGGAGAATACTCATTAACCGAAAATTATGTTGTGTCACAGGCGTATACTGTAAACTTAGTCGAAGAAAATGTTTTACGTGGTAATGCCGCCATTTTCAAATGCCTTATTCCAAGTTTTGTAACGGAATATGTCGCCGTTTCCTCTTGGATAATATCTGAGGGAGATGATGAAACAGAAATTCAATCAAACGATTTAAACAAAGAGGTCGTTACGCAAGCCTACACGGTTAATCTAATGGAAGAAAGTGTATTACGTGGCAATGCCGCTATATTAAAATGCCACATCTCAACTTTCGTCACTGAATATGTCAGTGTATCGTCTTGGATTATTTCTGAAGCTGATGTAGACGAGCTAGAAATTAAAGCCGAGTCGAATGATTTGGTTGTATCTCAAAGCTATACTGTTAACCTTTGGGAGGAAAACGTTTTACGAGGCAATTCGGCTATACTGAAATGCCACATTCCAAGCTTTGTTACTGAGTACGTCACTATTACGTCTTGGATAATTTCTGAAGGAGATACCGAGGAGTTGGAAATTAATTTAGATTCAGACATTTTATTAGTCGTTTCCCAAGCATATGATGTAAAATTTTGGGAAGAATATGTTTTACGCGGAAATGCTGCTATCCTTAAATGTCAAATTCCCAGTTTTGTTTCTGAATACGTGTCTGTTTCTTCTTGGATAATATCAGAAGATGAAATAGAAAAAGAAATTAAGTTAGACGAATCCACTGATTTAGTGGTTTCTCAAGCGTATGCTGTTAATCTAATGGAAGAATATGTCCTTAGAGGAAATGCAGCTATTGTGAAATGTCACATTCCAAGTTTTGTCTCGGAATACGTCACTGTTGTATCATGGATTGTGAGTGAGGGTGAAGAAGAGGTTGAAATAAAGCCTGATTCTAATGATAAGTTAGATGATGGAAAATATTTGGTACTGCCATCTGGCGAATTACATATCCGTGATGTTGGACCTGAGGATGGCTACAAATCATACCAATGTAGAACTAAACACAGACTTACTGGAGAAACTCGATTGTCAGCAACTAAAGGACGTTTAGTTATCACTGAACCAATGGGCAGCGCTGCTCCAAAAGTAGCGTCGAAAATGATCGATATAACTGAAACGACAATCAATAGTGCTAGTACATTGCTTTGTATGGCTCAAGCCTTCCCCGTCCCAGTGTTCAGTTTCCAACAATGGATAATTCTAGAGGCTTCAGTGCGAGTGTTAATGAAAGCGTTACTTTGTTGTGTCCTGCGCAAGCGTTTCCAGTTCCCCTATCCAGTAATGAAAGCGTTACGTTGTTGTGTCCTGCACAAGCTTTTCCAGTTCCTGTGTCCAGAGCCTATTAACAGTGCCCCACCGAAAGTACCAACTAAAACAATAGAATTCTTGGAGTTTGCGATGCGATCCAGTATTACCTTACTTTGTTTGGCTCAAGCGTATCCTGTGCCTGTTTTTAGAGTCCCTCATTTTCGAGCGGTTCAAAACTGGCATGGTTTGAATTATCCGCTATGGAAGATTTTGCCCTTTTATGTCCAGCGCAAGGGTTTCCTGTCCCAGTTTTTAGAACCGATTGGATCCAAATCGCCTACATTTTCAACAGACAATAAGCTCTCTTGGTATGTAAGAATAGTGGGTCAAAGCTTAGATCTAGCATGTCCCGCTCAAGCATTCCCAGTTCCAGTTTTCAGGTACTTAACTACACATTTCAAGGTTAGGCTCCAGGATATCCCTTGCCATTATACAGAACCAATCGGATCTAAATCTCCTACTTTTTCGACGGATGATAAACTTTCTTGGTATGTACGGACGCTCAATCAAAGCATAGATTTGGTGTGTCCAGCGCAAGCCTACCCTGTACCAGTGTTCAGGTGA

Protein sequence:

>DPOGS205028-PA
MKEPPNRVDFSNTTGAVVECAARGSPAPDVIWVRADGTAVGDVPGLRQVLPNGNLVFPPFRAEDYRQEVHAQVYACLARNQVGTIHSRDVNVRAVVAQHYDTDVNKEYVIMGNSIILKCQVPSFVADFIEVLSWHTDEKEDFYPGENYVVQQQYESEVNNEYVIRGNSAILKCSIPSFVADFVNVISWHDEAENSYTINGTKEGEVVTQYYEAEVVSEYVIRGNTAVLKCNIPSFVADFVKVEAWVDSDGGEYLLTDDIVVNQFYEAEILTEYVIRGNSAVLKCSIPSFVADFVKVEAWIDEEGTEITLIDNLVVSQYYVTEAENEYVIRGNAAIVHCKIPSFVSDFVYVESWIMDDGEILMISNTNMTVVSQPYEAEADNEYVIRGNAAIMKCEVPSFVSDFVYVEMWTDSDGGTYFPGNAEAVLQVYEARVNDEFVLRGNTAILKCIVPSFVADFVYVVAWLMDNETVTANENTNIDSVVHQNYEPRVIDEDVLRGNSAIVKCLIPSFVADYVQVVEWLTDEESLSVFSPNDPEGNYAVNQFYESQVYDIYVIRGNAAVFKCHIPSFVSDHVQVLSWHDSEGGEYSLTENYVVSQAYTVNLVEENVLRGNAAIFKCLIPSFVTEYVAVSSWIISEGDDETEIQSNDLNKEVVTQAYTVNLMEESVLRGNAAILKCHISTFVTEYVSVSSWIISEADVDELEIKAESNDLVVSQSYTVNLWEENVLRGNSAILKCHIPSFVTEYVTITSWIISEGDTEELEINLDSDILLVVSQAYDVKFWEEYVLRGNAAILKCQIPSFVSEYVSVSSWIISEDEIEKEIKLDESTDLVVSQAYAVNLMEEYVLRGNAAIVKCHIPSFVSEYVTVVSWIVSEGEEEVEIKPDSNDKLDDGKYLVLPSGELHIRDVGPEDGYKSYQCRTKHRLTGETRLSATKGRLVITEPMGSAAPKVASKMIDITETTINSASTLLCMAQAFPVPVFSFQQWIILEASVRVLMKALLCCVLRKRFQFPYPVMKALRCCVLHKLFQFLCPEPINSAPPKVPTKTIEFLEFAMRSSITLLCLAQAYPVPVFRVPHFRAVQNWHGLNYPLWKILPFYVQRKGFLSQFLEPIGSKSPTFSTDNKLSWYVRIVGQSLDLACPAQAFPVPVFRYLTTHFKVRLQDIPCHYTEPIGSKSPTFSTDDKLSWYVRTLNQSIDLVCPAQAYPVPVFR-