Monarch geneset OGS2.0

DPOGS208937
TranscriptDPOGS208937-TA2598 bp
ProteinDPOGS208937-PA865 aa
Genomic positionDPSCF300009 + 120712-129354
RNAseq coverage545x (Rank: top 23%)
Annotation
HeliconiusHMEL0047661e-13468.85% 
BombyxBGIBMGA002409-TA0.081.07% 
Drosophila% 
EBI UniRef50UniRef50_E0VS103e-15952.66%Angiomotin, putative n=1 Tax=Pediculus humanus corporis RepID=E0VS10_PEDHC
NCBI RefSeqXP_002428904.16e-16052.66%angiomotin, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420167751e-15852.66%angiomotin, putative [Pediculus humanus corporis]
NCBI nr blastxgi|3838539203e-16149.24%PREDICTED: uncharacterized protein LOC100883025 [Megachile rotundata]
Group
KEGG pathwayptr:4514913e-43 
 K06104 (AMOTL1, JEAP)maps-> Tight junction
InterPro domain[43-819] IPR0091147.6e-55Angiomotin
Orthology groupMCL16818 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208937-TA
ATGGGGTCCATGCGACCTAATGGACGTTTCCTTTCATTTGCTAATAACATACAGAGAACTCAAAAGCAACCTGTCCCAAGTGGATTTCCTCAAAGTCTATCCGGTAGTGAGACAGATGTGTCCACATCAAATGAGAATCTGTCAAGAGAGGAGAGGTATGTTGTGAGGCACACAGCACGAGTTGAACCACAAGGACAGGAAAATCAAAGTCAGACTAACAATAATAATAACAACAATAGAAACACATTGAAGGATAATGTAGGGGGCAGTAACCGCAACTCACTAAAAGACTCAGTAGGTGGAGGTAACAGCAATCGGAGTTCGCTGGATGTTTCATCATCATCATATAACACTCTGATCATTCATAACCAGGACGACTCCTGGTCTTCAAGACCAACACCAATTAGGGAACATGAAAGAACAAACAGTGAAGTTAAGCATTCAAATACACAGTCTTCCCCATACCACACTTTAAAAAAGAGTGATGCAGTGAAGAAGCCTAGCGGGATTCCGCTACCCAAAGTTCACAAAGAGCAGACTGTTACAGCGAGTGCTAATTATATTGATATTGGAGGTCAGAGGATATATACAAGCCCACCGGATCAAGGGGTTCAGGAAATAAATGAAATACCGGATGATTTTCTGAATCAGTCATCAGTTCTGAAACATCTTGCTAAGGAAGTAACCCAATCTCCGACACCTCGAGGGCTCACACCTCCAGCGTCTCCCCACTCGACTCGAGCTCCCTCGAAACCCCGTGAAGAGAGGAAAGGAAAAGGATCGAAAGCTAAACTCAGTAAGGAGAAGTTGAATTTGTCAAGATCACAGCCCGATCTAACAAGTGTTGGCGTCCGAGCAGTACCAGGTGGATCAGAGTCCAGCGGTTGGTGTAGTGGAGGGGAGGGTTCTTTGGAGGAGGCTGATGACGCGTTTGCAGCTGTTCTGGACGCTCTTGCAGCTGAGAACCACGCTCTAAAGAGACAGCTGGCTGACGCGTGCGAGCGAGTCGCTAAGACACATAAGTTGGAGCAGGAGGTGGAAAAGGTTCGTACTGCCCACGAGGAGCTCGTGGGCTCGTGCGAGCGACGGGAGCGGCTGGAGAGAGCCGCTCGGGTCAGGCTGCAAGCTGACTGTAGACGCCTACACGAGATCAACAGGGCTCTCAAACACCAGACGGAGTTACTGTCATCTGGAGGTCGAGCGGAGGGCGGCGCTAGTGTGGAGGCTCTGCGGAAAGAACTACAAGGACGGGAGATGCTCATAGCACAACTCATTACACAGAATAAGGAGTTGGCTTGCGCTAAAGAGCGTCAAGAGATAGAGATGTCAGCTCAGCGGGCGACTCTACAGGAACAGAGGACACACATCGACATACTGGACACGGCGCTGACTAACGCTCAGGCTAACGTGGTCAGGCTGGAGGACGAGTGTCGTCACGCGAGTGGGTACGTGGAGCGCGTGCTGGGTCTGCAGAGGGCGCTGGCGTCGCTGCAGCAGGCCTCGGACAGGAGAGAACACACGGAGAGGAAACTCAGGGCGCAGCTCGAGACAGAACTACAGGCTCTCAGGAAACGTGAGTGTGTGTGTGGCGGTGTGGATACCTCCGGTGTGAGTGGTGGTGGGGGCGGCGGGGGAGGGGGCGCCGCGTGTGGGGGGGAAGCGGGGGCGGAGGCCGAGCTCAGGCGGGCGCTGCGGTCGAGGGACGAGAGGCTGCTGGCTCTAGAGGGGGAGTGCGCCAAGTGGGAACAGCGCTACCTCGAGGAGGCCGCACTCAGACAGGCGGCGGTGTCCGCAGCATCCATACCCAAGGACGCTAAGATCGCGGCCCTGGAGAAGACGTCGGCGGAGTCCGAGCGACTGATGGCAGAGGCTCGCAGCGAGAAGATACGGCACATGGACGAGCTGCACTCTGCACAGAAGAAGGTCGCCGACCTGGAGAGCAGGCGGGCGCTGCGGTCGAGGGACGAGAGGCTGCTGGCTCTAGAGGGGGAGTGCGCCAAGTGGGAACAGCGCTACCTCGAGGAGGCCGCACTCAGACAGGCGGCGGTGTCCGCAGCATCCATACCCAAGGACGCTAAGATCGCGGCCCTGGAGAAGACGTCGGCGGAGTCCGAGCGACTGATGGCGGAGGCTCGCAGCGAGAAGATACGGCACATGGACGAGCTGCACTCCGCACAGAAGAAGGTCGCCGACCTGGAGAGCAGGGTTAAAGAGCTAGAGTCCAAGGTGGCGGAGCGTGATGCGATGATCAAAGTGCTGCAAAAGCACACGAGCGCCGCCTCGCTCAGAAACAACTCGAGTCGAGAGGAACTCGTGGGTCTGTCGTCGGGAGCGTCCTTCTCCAGCGCGGAGGGCGTGGGCTCTGCCGGCGTCACCAACCGCTACAGACACCTCGCTAGGAGGAACTACTCCCCTCACAACGACAACAGCGGCTGCGGGTTCGACAGTTCGTCTCTTCGCCTGGAGGAGCAGTTGGCCGCGCTGGAGTCCCGCCTGGACCGGCCGCCGGTGCCCGCTGTGAGTTACCTCTCAGACGGCCCCCCCTCCCCACACTCCCCGCACTCCCCCCACCCTCGACCACACGCCAGCTACTACTAA

Protein sequence:

>DPOGS208937-PA
MGSMRPNGRFLSFANNIQRTQKQPVPSGFPQSLSGSETDVSTSNENLSREERYVVRHTARVEPQGQENQSQTNNNNNNNRNTLKDNVGGSNRNSLKDSVGGGNSNRSSLDVSSSSYNTLIIHNQDDSWSSRPTPIREHERTNSEVKHSNTQSSPYHTLKKSDAVKKPSGIPLPKVHKEQTVTASANYIDIGGQRIYTSPPDQGVQEINEIPDDFLNQSSVLKHLAKEVTQSPTPRGLTPPASPHSTRAPSKPREERKGKGSKAKLSKEKLNLSRSQPDLTSVGVRAVPGGSESSGWCSGGEGSLEEADDAFAAVLDALAAENHALKRQLADACERVAKTHKLEQEVEKVRTAHEELVGSCERRERLERAARVRLQADCRRLHEINRALKHQTELLSSGGRAEGGASVEALRKELQGREMLIAQLITQNKELACAKERQEIEMSAQRATLQEQRTHIDILDTALTNAQANVVRLEDECRHASGYVERVLGLQRALASLQQASDRREHTERKLRAQLETELQALRKRECVCGGVDTSGVSGGGGGGGGGAACGGEAGAEAELRRALRSRDERLLALEGECAKWEQRYLEEAALRQAAVSAASIPKDAKIAALEKTSAESERLMAEARSEKIRHMDELHSAQKKVADLESRRALRSRDERLLALEGECAKWEQRYLEEAALRQAAVSAASIPKDAKIAALEKTSAESERLMAEARSEKIRHMDELHSAQKKVADLESRVKELESKVAERDAMIKVLQKHTSAASLRNNSSREELVGLSSGASFSSAEGVGSAGVTNRYRHLARRNYSPHNDNSGCGFDSSSLRLEEQLAALESRLDRPPVPAVSYLSDGPPSPHSPHSPHPRPHASYY-