Monarch geneset OGS2.0

DPOGS213854
TranscriptDPOGS213854-TA3252 bp
ProteinDPOGS213854-PA1083 aa
Genomic positionDPSCF300361 - 81804-93045
RNAseq coverage593x (Rank: top 21%)
Annotation
HeliconiusHMEL0103040.082.15% 
BombyxBGIBMGA009661-TA0.072.30% 
Drosophilavlc-PD3e-3248.55% 
EBI UniRef50UniRef50_D6WRU72e-3240.95%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WRU7_TRICA
NCBI RefSeqXP_974796.23e-3339.67%PREDICTED: similar to vlc [Tribolium castaneum]
NCBI nr blastpgi|1892387916e-3239.67%PREDICTED: similar to vlc [Tribolium castaneum]
NCBI nr blastxgi|1954026252e-5132.40%vlc [Drosophila virilis]
Group
Gene OntologyGO:00072671.1e-13cell-cell signaling
KEGG pathway 
InterPro domain[835-992] IPR0050261.1e-13Guanylate-kinase-associated protein
Orthology groupMCL21932 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213854-TA
ATGTCTGAAAAAGTCATTGCATTCATTAGTGCACCACACTCATTCGATTTACCTGATGAAGAAGATAAGAAATCGCCCGTTCTTTTGTTAGGAGTGAACAAGTTATCGGATTCTCCATTTTTCCTGAGAAGAAGACTCACAGACCCTCACTCATCAGGATCAATTAGATTCTTCCGCCCGCCAAAATCAAAGTATACGAGGGGTCTAAATGGCGATGAGAGATTGGAAAAGATTAGACAAGATCTTCTGGTTCATAAGAACAAACCAGACACCTACATTCCAAAACCTGAGGTCTCAATAGAACCGGAAACTAAAGGGCCTTGTATTAAACTAGGCTGCGTCGGTAACTGCATCTGCGACAGGAAGAAATCTTCGTCCAACAGAGGCTCTTTTTCATTTCATAGGCCAGAAAACAGACAGCCTTCTCTATGTCCATCAGAACCCTATGGTAGAGCTGAAAGTAGAGACAACACAGAAAACGACATAGAGTTCAAATATACGGAACACATTCCAGATGTACAACCCTCAAGCAGCAAGCAAAACTCATACAACGTGACACCGAAGTCCACTGAACGAAGACACATACTACTTTCAAACTTGAATGAGAGATTCCGCAAAACCCTCAGCTTAGATTCAAGGAAGATCGAAACAAATCGAGTGCTGAGCCTGCCACAGGAACCTAGACCAAAATCTCTAGTTAATACAGATAACATTTGCGTGCAATACTCTTCGAAAGATCCCTTCATTCCGGCTAATTTAGTAAAAACCAGCCAGCCCAAGAAGAAACGCAAATCGTTCTTTTCTTTAGACACATTCTTTGATTCAAAAAGGTCAGACACGTCCTCCATAGATGACTATTGTTCCTCGAAATATAAGCGTTTTGAACTAGAAAGTGATGACTCTCCATTGTTCAGACGTGAAAGACCAAGAGAAAGGACCCCAGATCTTCTTAACCTACCTGTAATCATAGATTCCGTGCGAAAAAAGCAATCAGACACAAAGAGGCAATTGGAGAAATTAGAAAACCATTTTTATAAGAGCCTGGACAAACGCACAGTTATTGACGCCGTACCAGCCGATGCGACAGAGGGACCGAGCGGTATCAACCATCTGTACGACGAGGAAGTAGAGTATATAGAACCGATACGAAGCACCTTCACCAGCCAGAGCACCTTGATACTAGACAAAAACACAGACACGCTGAAACAAGACTCCATACTGACGATAAACACGGAGGAAATTGACATAAAACTACCAAGTTCAACAGTGTCGCCAGCCTCAAACAAAATAGAAAATGACATAGACAGTATCGGGGCTTACGAAATAGAAGTCAAGGAGTGCGATCTGAAGTGCACTAAGAACTTGGTTGTCAAAGTTATGGACAAATCTGTTGACTCGATTGGAAGTTGTTCCTTAGATGTCGATGCTAGTACGGACTTCTCAGATACTACATCCGGAAGCTTGAATCTCCTTACGCCTTCGTCGACGACCAGTAGGATACGGGATTTCACTTCCAGGATACAGGAACGAGCCTCCAACCTCCACACAATATCACCGCAGGCTCCAATATCACCAGCCAGGACACCAGACACAACCCCAACACCTCGGAAGACTGATCACCTCCTGAAACCTCCGCCGAAGATATACATTGACACAGCGAGTCACAGGTCGAGGAGTCACTTGAAGAAACGGGACGAACTACCGCAGAAGAAGCCAAGCTACCTGAACCTAGCGTGTTCAGTGAACGGGTACACGAACCTGACAACGTACGATTCCAAATTGCGTCAGGACATCAACAAGAGCCGAGAAGCTTCCCCAATAAGGCCCATAACACACACGTACCAGTATAAAAGTGAGAGCAGCTCACTGTTAGTGCCGATCCCAGTGAGCGCTAACAAACTGTTGGTACCGAAGTTCGGTCCTAATGATACTCGCACTGATTTGACGCCCAAAGCGCCTTCCAAGGCGCTCACCGACCCACACATAGCATCCCCATTACATGCCTATATGGCAGCAGAAAATAAACAGCTGAAGAACGACTTCTTAGGGCAAAGCATGACGACTACCAGTCGTCAGTTTATATCGAATGAGGGCAAGAACTTTGCTGCGTCTATGTTACACCAGAAGGATGAGGTGGATAACGTTAAGGAAATAACATTCAAGTCCAGTTACTCGGAGACGAACTTCAGACAGACGGTCAGCAATGGCAAAGAGAGCAGGTTCTCCTCTGAGTCGTATACGATATCTTCGAATGGTGTCTCCAAACGGGTCGAGATCACCAAAGAAAACGGCGAGAAGCTGACGAGTCCCATGAAGAGCTTCATACAACAGCGCGTCGAACGCCTGTACGGGCCGGGAGCGCTCGCTCAAGGATTCTTCAATCAGAAAAGGCACAAGCTGAAGAGTACAAGCGACGATGAAGACTCGAAGGTGTTGACGGAGAAGTCATTAAACTGCCCCAGCGAGAGATTCGTGTCACCGCGCAAAACGAACGAAAGCTTCGACAGTGAGAACATATGCACCAGTCCCACGAATGATACAACAGTGCTGCCTGTACTCAGGCATCTCAGGCCCGAATTTCGAGCTCAACTGCCAGTGCTCTCACCGCGGAAGAGCTTGAAGTCCGATCTCTCGCCTCAGAAGCTGGAACAAGACATTCCGGAAGCGAAAAAGACTGAATTGGTCGAAACCAGCGAAGTGACCAATGGCCTATCTGTGATAGATTTAAACAAACCAGTGAAAGAAAATTGTGAAAGTGAAAAAGTTAACGGTGATGTGGTTAAAGATGGACATTACTTCTTGGATTTGGAGAAGAAGGAAACTGAAAGGTTGATTGCTCTGGCCGTGGGCGCTGAAAAGGAGTTGGAGCACTTGCAGAATGTTGACAATGTAAGTGAAGAAGTGCTGGGCTTCCTCCGAGCTGCCTCTGGCAAGGCGAGGCTGTTGGCCACACAGAAGATGCAGCAGTTTGAAGGTGCACCCACAAATCTCCCCCTTGAGACAAGCTACACCCAACTCAGCGAAATAATAGACTTCTCTGAAGTCAGGCTCCGACTATCATTAGAGCGTGCAGCTCGTGAACGACAGCTGGCAGCTCGGGCTGGACCCTGTGCTGGAGAGAGCTCGCTCGGACAGAACGGAGAAGAGAATAAGGAGGTAGAAATATTTGTAGGCAAAAATTCAAAGTCGAAGCAGATACAACAAGGTGACTGTCTCGGTTCGTCATCAGAAAACTCATTTACTCAGAGCTGA

Protein sequence:

>DPOGS213854-PA
MSEKVIAFISAPHSFDLPDEEDKKSPVLLLGVNKLSDSPFFLRRRLTDPHSSGSIRFFRPPKSKYTRGLNGDERLEKIRQDLLVHKNKPDTYIPKPEVSIEPETKGPCIKLGCVGNCICDRKKSSSNRGSFSFHRPENRQPSLCPSEPYGRAESRDNTENDIEFKYTEHIPDVQPSSSKQNSYNVTPKSTERRHILLSNLNERFRKTLSLDSRKIETNRVLSLPQEPRPKSLVNTDNICVQYSSKDPFIPANLVKTSQPKKKRKSFFSLDTFFDSKRSDTSSIDDYCSSKYKRFELESDDSPLFRRERPRERTPDLLNLPVIIDSVRKKQSDTKRQLEKLENHFYKSLDKRTVIDAVPADATEGPSGINHLYDEEVEYIEPIRSTFTSQSTLILDKNTDTLKQDSILTINTEEIDIKLPSSTVSPASNKIENDIDSIGAYEIEVKECDLKCTKNLVVKVMDKSVDSIGSCSLDVDASTDFSDTTSGSLNLLTPSSTTSRIRDFTSRIQERASNLHTISPQAPISPARTPDTTPTPRKTDHLLKPPPKIYIDTASHRSRSHLKKRDELPQKKPSYLNLACSVNGYTNLTTYDSKLRQDINKSREASPIRPITHTYQYKSESSSLLVPIPVSANKLLVPKFGPNDTRTDLTPKAPSKALTDPHIASPLHAYMAAENKQLKNDFLGQSMTTTSRQFISNEGKNFAASMLHQKDEVDNVKEITFKSSYSETNFRQTVSNGKESRFSSESYTISSNGVSKRVEITKENGEKLTSPMKSFIQQRVERLYGPGALAQGFFNQKRHKLKSTSDDEDSKVLTEKSLNCPSERFVSPRKTNESFDSENICTSPTNDTTVLPVLRHLRPEFRAQLPVLSPRKSLKSDLSPQKLEQDIPEAKKTELVETSEVTNGLSVIDLNKPVKENCESEKVNGDVVKDGHYFLDLEKKETERLIALAVGAEKELEHLQNVDNVSEEVLGFLRAASGKARLLATQKMQQFEGAPTNLPLETSYTQLSEIIDFSEVRLRLSLERAARERQLAARAGPCAGESSLGQNGEENKEVEIFVGKNSKSKQIQQGDCLGSSSENSFTQS-