Monarch geneset OGS2.0

DPOGS210888
TranscriptDPOGS210888-TA2721 bp
ProteinDPOGS210888-PA906 aa
Genomic positionDPSCF300045 - 857527-868475
RNAseq coverage123x (Rank: top 57%)
Annotation
HeliconiusHMEL0039530.083.30% 
BombyxBGIBMGA003766-TA1e-15479.11% 
Drosophilaab-PA4e-6743.45% 
EBI UniRef50UniRef50_E0VF313e-12749.49%Protein abrupt, putative n=6 Tax=Neoptera RepID=E0VF31_PEDHC
NCBI RefSeqXP_974947.21e-14251.06%PREDICTED: similar to abrupt CG4807-PA [Tribolium castaneum]
NCBI nr blastpgi|1892378532e-14151.06%PREDICTED: similar to abrupt CG4807-PA [Tribolium castaneum]
NCBI nr blastxgi|1892378536e-14851.31%PREDICTED: similar to abrupt CG4807-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055151.2e-24protein binding
KEGG pathwaydme:Dmel_CG114917e-35 
 K02174 (BR-C)maps-> Dorso-ventral axis formation
InterPro domain[257-371] IPR0113338e-31BTB/POZ fold
[276-370] IPR0130691.2e-24BTB/POZ
[285-380] IPR0002101.4e-20BTB/POZ-like
Orthology groupMCL14746 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210888-TA
ATGTCGAGTCTACCACGAGGCGTGAGGAGTGAGCTGGTGTCCGAGATCCAGGAGCCAGCTGGTCCCGGCCTGGCCGCGCTCGTGTCCGCGGACAGGCCGGCCGCTGTGGGCGTGCAGGCTGCACCGCCCCGCCGCCGCTGGCTCTCCTCTCCTCTCCTCTCCGCTCCTCTCCTCAGCTGTCCTCCTTCCCCCCGGCCCCGGACCGTGCCCCCCGCAGCCCGGACCCCGCGCCCCGGTCCCCCCTGGTTATTCTCGGACCGGTGGCGGCAGGCGGGGCGACAGTCGCCCGCGTGTTTTGTAAAGACGTGTGTCGGGTGTCGAGCGACGGTCGCGGAAAATGCATTTTCCGGGTGCAAGGCGCAGGCGGTGCGGTGCGGCGCGGTGTTGGGGTCGCCGGGGGTGTGCGAGGGGTGCGCGGGGTGCGGGGCCGAGGGTGAGGAGAGGGGGGTGTCGAGAAGGGTGCAAGCAATGATGGACATCGTATTGATTTCTATCTGGCCGCTCCTCCCTTATTGTGGTGGTGCAGGGCGCGCGCCATCGATCGTCGTTGCACCGAGTCTACATTCCCCCCGGGGTGTCCCGAGTCCGAGCTGTCCGGACCTGATCTTCGGCGACCCCAAGCCGACTGATAAAGTGACAGTCGCTAAGGTGGCGTCGATAACGACCCGCGTGGAATGCGCTGGCGCGGAGTATAGCGCGCGTCCACGAGCAGACAGCGCCGTGCCGCCCTCCGAGCACGCCATGGCCACCGACACCGGCGGCGCGGGCGAGCAACAATACTCGCTAAGATGGAACGACTTCCACTCGGCGATGGTGTCGTCGTTCCGGCGTTTAAGAGACGAGGAAGACTTTGTTGATGTCACACTGGCCTGCGCCGGGGCTACATTCACCGCACACAAGGTGGTATTGTCAGCCTGCAGTCCATATTTTAGAAGATTATTAAAGGCGAATCCCTGTCAGCATCCGATCGTTATTCTAAGGGATGTTCACGATAAAGACATGGAAAGTTTACTAAGGTTTATGTACCAGGGGGAAGTTCACATTGGACAAGAGCAACTAAAAGAGTTCCTAAGGGCCGCCCAGCTTTTGCAGGTCAGAGGCTTAACAGACGTTCCTCCTCCGGTTCCAACACTCGACCAGAAAGCGTCGCCGTCCGCGTCTTCATGGACGGAGACGGGCAGCGGAGGGTCTCGTGAGGGTCGCGAGGGTCGAGAAGGTCCTGGAACAGTCGGCCGAGAAGGTAGACGAGCCCGGCGACCAGACGACGCACCCGCCACTCCACCGCCCAAGAGAGCCCGTTCATCTGACCTGTACCAGGCACAAATGAAAACCAGGACGGAGCAACTCTTGGCCCAGGGCAGTCCCGAGCGTCTCGCTGCCAACGGACATGAGCTGGCGCTTTTCGGTCACGCGCTCGACAACGCACACCATCCTCATCTCTCCAACGTATCCAACAATCTGTCTGGTGACGGCGAGGATTCTTCTTCTGACGCAGGCGCGAGTGACGCCGAAGGTGAAAACAGAGCGAAACAGGAACCGATAGATTATGACGACCCCGCGACCATGGCCAACACTAACGGTGCTCTCCCACATAACTTTCCAGGACTCTTGAACCTACCAGGATTCCCGGGTCTTCCAGGCCCGTCGGGAATACCGGATAATTTCGGTGGATGTCGCCGTACCCAGGATCTGCTGCGCGTCAGAGCGACTGATCCTAGGCCCTGTCCAAAATGCGGAAAAATATACAGATCAGCGCACACTTTGAGAACACATCTTGAAGATAAACACACTGTGTGTCCCGGATACCGATGCGTGCTCTGCGGCACGGTGGCTAAGTCCAGGAACTCTCTACACTCTCATATGTCACGCCAACATCGCGGCATATCTACCAAGGATCTGCCCGTGTTGCAAATGCCCACACGCTTCGATCCCGAGCTGGCTAGCCAACTTCTAGCTAAGGCTGGTGTAAAGGTTTCCCCCGAACAACTCCGTGCCCGCGCATCTCCAACGGGCGCTCGTCGGTGCGACATGAAGCTGGACGCCAAGTCGGCAGCCTCCGAGAACAGCTCCCTGTGCGGAGACAACGACGACGACCTCTCGCAGTCCAAGTACCAAGACACGCTACCGCATTCACCACCGCATATAAACAGCTTGGCGAACACGACAATAACAAAAGTACCCGCGGTGCGAGCCGTTACCGCCAAAACAGTGGAGAACATGCCGCATGGACTCTCTGGCATGAAACACGACGACTACCCGCCAGGCTTCGGAGGAGGGTCAGCTATACTGGACACGTACCTTCAGTACATCGGAGAGAATGTTTTTGGAATGAACGCAGCCAAGTTACAACAGATGAACGCCATACACATGGATAGGAAGACGTACGACGAACCGTCCCCACAGATGGGCTCTCTGCCGCCACCGCCGACGACTCTAGCGCACCAGCGCTTCCTCAAACAGATGCAGCGCCAGTATACCGACAACATGGCCAAGCCGGATATCACGCGCGACCCGGACGAGCCCCTCGACCTGGGCAAGGAGCGCCAGCGAGCGGAGAGCGCCGGCGACGACAACGACAAAGACTCGCTGGACAAAGACTCGGGCGACTTCTACAGCAAGAACTACGAGGAGGAACGCCGCGGCGAGGACCAGCGCTCGGGCTCCCAGCATCAACACCAGACCGAGCGCGAGTACGCCGAGGACGACAACAAGGCCTCATGA

Protein sequence:

>DPOGS210888-PA
MSSLPRGVRSELVSEIQEPAGPGLAALVSADRPAAVGVQAAPPRRRWLSSPLLSAPLLSCPPSPRPRTVPPAARTPRPGPPWLFSDRWRQAGRQSPACFVKTCVGCRATVAENAFSGCKAQAVRCGAVLGSPGVCEGCAGCGAEGEERGVSRRVQAMMDIVLISIWPLLPYCGGAGRAPSIVVAPSLHSPRGVPSPSCPDLIFGDPKPTDKVTVAKVASITTRVECAGAEYSARPRADSAVPPSEHAMATDTGGAGEQQYSLRWNDFHSAMVSSFRRLRDEEDFVDVTLACAGATFTAHKVVLSACSPYFRRLLKANPCQHPIVILRDVHDKDMESLLRFMYQGEVHIGQEQLKEFLRAAQLLQVRGLTDVPPPVPTLDQKASPSASSWTETGSGGSREGREGREGPGTVGREGRRARRPDDAPATPPPKRARSSDLYQAQMKTRTEQLLAQGSPERLAANGHELALFGHALDNAHHPHLSNVSNNLSGDGEDSSSDAGASDAEGENRAKQEPIDYDDPATMANTNGALPHNFPGLLNLPGFPGLPGPSGIPDNFGGCRRTQDLLRVRATDPRPCPKCGKIYRSAHTLRTHLEDKHTVCPGYRCVLCGTVAKSRNSLHSHMSRQHRGISTKDLPVLQMPTRFDPELASQLLAKAGVKVSPEQLRARASPTGARRCDMKLDAKSAASENSSLCGDNDDDLSQSKYQDTLPHSPPHINSLANTTITKVPAVRAVTAKTVENMPHGLSGMKHDDYPPGFGGGSAILDTYLQYIGENVFGMNAAKLQQMNAIHMDRKTYDEPSPQMGSLPPPPTTLAHQRFLKQMQRQYTDNMAKPDITRDPDEPLDLGKERQRAESAGDDNDKDSLDKDSGDFYSKNYEEERRGEDQRSGSQHQHQTEREYAEDDNKAS-