Monarch geneset OGS2.0

DPOGS202957
TranscriptDPOGS202957-TA5010 bp
ProteinDPOGS202957-PA1669 aa
Genomic positionDPSCF300195 + 377994-401408
RNAseq coverage36x (Rank: top 74%)
Annotation
HeliconiusHMEL0028410.084.36% 
BombyxBGIBMGA005747-TA0.076.60% 
DrosophilaDscam3-PB0.042.40% 
EBI UniRef50UniRef50_D7GY920.045.17%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D7GY92_TRICA
NCBI RefSeqXP_968319.20.046.68%PREDICTED: similar to CG31190 CG31190-PC [Tribolium castaneum]
NCBI nr blastpgi|1892421220.046.68%PREDICTED: similar to CG31190 CG31190-PC [Tribolium castaneum]
NCBI nr blastxgi|1892421220.046.68%PREDICTED: similar to CG31190 CG31190-PC [Tribolium castaneum]
Group
Gene OntologyGO:00055152.9e-13protein binding
KEGG pathway 
InterPro domain[1207-1332] IPR0089572.9e-26Fibronectin type III domain
[928-1026] IPR0137832.1e-23Immunoglobulin-like fold
[236-311] IPR0035983.5e-16Immunoglobulin subtype 2
[131-218] IPR0130986.2e-15Immunoglobulin I-set
[1228-1306] IPR0039612.9e-13Fibronectin, type III
[230-322] IPR0035993.8e-10Immunoglobulin subtype
Orthology groupMCL10022 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202957-TA
ATGACGTCGGATGCGTTGGCTATCAGACGAGCTGAACTGACGGACGACGGCCGCTGGGCGTGTCGGGCAGCGAACGCTCATGGCCATGTGACGCTGCGACTCCATCTGTCGCTACGAGCGCATCTCACCATACACGCCCAACCCCATACACAATTTACTATTTTGAAAGGTGACAAAGAATTTGCCAAGAGATGGTATCGTGAAACGGGAGGTGGAGTACTTCGTGAGATACGTGAGGGTGAGGGCGGGGGACGGTACAGCATGACGTCGGATGCGTTGGCTATCAGACGAGCTGAACTGACGGACGACGGCCGCTGGGCGTGTCGGGCAGCGAACGCTCATGGCCATGTGACGCTGCGACTCCATCTGTCGCTACGAGCGCATCTCACCATACACGCCCAACCCCATACACAAGTAGTAAATAGTGGAGGTACGGCTATCATAAACTGCACATGGGCTGGCTGGCCAACACCCCGCTTGGAGTGGCTTCACAATGGCATACCTCTCTCGGCTGGCGTCGCTGGAGGTAGAGTGCGAATACAGAATGGAGGAGAACAACTAGTTATAACATCCGTCCATCGAGCTGATAAAGGAGTCTATCAATGTGTAGCCAGGAACGAGAGAGATTCAGCTCAAGCCAGCGCCGAGTTGAGGTTAGGAGACACGGCGCCCGAGTTACAATACACGTTTATCGAGCAAGTGCTGCGACCCGGTCAAGTTGTAACCCTCAAGTGCTCGGCGGCTGGTTCCCCTCCACCACATTTCAGCTGGTTACTGGACGGGCAACCTCTCAATACCATGGCTCGGGGACATAGGTACAGTATTGAGCAGTTCGCAACTAAAAGTAACGAAGTGGTGAGCTACTTAAACATCACCGCTGTGCGCTCTGAAGACGGTGGACTTTACACTTGTAGAGCTGCAAACTCGCTGGGGGAGATAGCGCATACATCCAGACTTAATATTTATGGTCCACCGTACGTACGATCCATAGGCCCGATCAGAGCCGTCGCTGGCAGGGAATTGGTACTGTACTGCCCGTACTCTGGTTATCCAATTAGCTCAGTAAGATGGGAACGCGACGGAAGTCAGCTGGAGTGGGAGGGCAGTACGGGTGGAGAAGGGGCTCTGAGGATATCACGCGTTGAAGCGAGCAGCGCTGGGGCATACACTTGTAGTGCTGTTGGACCACATGGGGAAATTGCGAGACGAGAGCTGCAGCTTATTGTCAGCAATCCTCCCGAAATCGAGCCGTTCTCCTTCTCCGCCAACTTGCAAGAAGGCAAGCGAGCCCAAGTTAGTTGTAGCGTGGCATCCGGCGACATGCCTGTGCGATTCGCCTGGCTCAAGGACGATCTGCCCATACCAGCTGACCTACAGGTGGAAGAACGTGGGGCAGATTTTTTCAGCAACCTCGTTTTCAAAGAGGTTTCAGCTCGTCACAGCGGTCGGTACACATGTGTTGCGTCCAACAGCGCGGCCAAAGTTAACTATACCGCGGAGTTGGTGGTCAAAGTTTCTCCCAAATGGCTCCGCGAGCCCATGGACGCAGCTGTATTAGCTGGAGAGCAGCTAGCGTTGCACTGCCACACGACAGGATCACCCGCACCACATACTACGTGGCTTAAACAAAGAGCTGGTTCTGCATCAGACTTTGTTCCAATCATCAACCTTGGAGGGAGGTTTCAGTTTCTCTCGAACGGCACTCTCTGGATTGAAGCAGCCTTGCCGTACGATGAGGGTTACTATATGTGCAAAGCTGAAAACGGAGTTGGAACTCCGCTGTCCAAAACTATATTTGTGGCTATTAATGAGCCCGCACGTTTTGAGATGACGTCTCATAACGTGTCCGCAAGACGGGAGGCGGGCGCGACGCTCGCGTGTGAGGCGCGGGGCGACGCACCGCTGAGGGTCACTTGGTCAAGGGACAACGCCCCGCTTGACCTCTCTACTTATAGATTAAGCATTTCTGAAGCAAAGACTGATAGTGGTTTAAGGTCCCAACTGTACATAAGCCGCACAGACCGACAAGACTCCGGCGTCTACAAGTGTCAGGCCGTCAATGCCTACGGTCACAGCGACCATTACATTTACTTATCCGTCCAAGAGAAACCTGAAATGCCTCAGTCATTGTCTGTGACTGAGATCCAGTCCCGTGCTGTCCGCCTGTCTTGGAGCGCGGGCTTCGATGGCAACTCACCTCTCCACGGCTACACGGTCCAGTACACGCCGCTTAGTACTCAGGCTCGAGGTGAAAGATGGGAGAATGCCGCTACACTGAACGTCACCTGGGCCGAACACGCTGGACATACACACTCTCACGTCACGCTCACCAAAAAGGGCGATATACATTACGAAGCATTATTGAGTAACTTGCGGCCGCACACAGCGTATATGATCCGCATCGCCGCTATCAACCAGATAGACCGTAGTGCTTTTACTGAACCAGTGGTTGTAAAGACACAGGAGGAAGCCCCGTCCGAAGCCCCCAGCGGCGTGTCAGTCTCAGCTGGGGCGGCAGGTGAACTGCACGTGTCGTGGCGCGCTCCTCCCCGGGACGCATGGCACGGAGAGTTACTAGGGTACTCTGTGACTTGCGCCGAATTAGGTCCAGACTCTGCTCCGATACCCAATGCGACTAGAACTCTCACAGTTAACGGTTGGTCGGCAAGTGAACTGACCCTTTCAGCACTCAAGAAATTCACTCGTTATGAAGTACGCGTCCGTGCTTTCAACGGCATCGCGGCTGGACCTCCGTCTGTACCCGTCACAGCTACAACATTAGAAGGAGTGCCTGAGGCGGCCCCGAGCCACGTTTCGTGTTCCGCCCTATCATCCTCAAGCCTTAAAATATCCTGGAACGCGCCACCGCCAGCGTTACGAGGCGGAATAGTTCAAGGATACAAGATTATCTACGCCCCGCTTTCTATCACGCACTCTGAGGGTGCTGAAATGAAGCGTGTGTCTACTACGGAGACTTACCTGCATACACTACACAAGTATAGCAACTACTCAGTGCAGGTGGTGGCGTACACAGCGGCGGGAGACGGCAAGAGGAGCGCTCCAGTGTATTGCATGACGGAAGAAGATGTGCCCTCGGCTCCGGAAAAGATCAAGGCGTTACCATATTCAAGCGACTCGGTGCTGGTAAGCTGGCTGCCACCCTTGCACCCCAACGGTATCATATCACACTATACGGTTTACTACAGGGAGGCTGGACGGTTAGGCAAGCACTCAACCTTCACAGTGTCAGCTGATAAGTCTCCCGAGATGGAATTGATGTTTCAAGTTAGGAATCTGATGGAAAACCAGTTGTATGAGTTCTGGGTGTCAGCGACTACTGGCTCGGGAGAAGGGGAGTCTACTCTGGTAGTAGGCCAAGGTCCCAGTTCCAGAATTCCAGCTCGCATAGCCTCTTTTGGTGGTACGATCCGCGTTGGTCCCGGTCGCGGCGCCTTACTGGCGTGCTTGGCTGTAGGGGTCCCTCCACCTAGAACGCGGTGGGCGCACGCTCGAGCCCCCGTTACACATCACCGTTACTATCAAGTTACCAGAGCTGGACATTTACATATTCGTGAGGTCAACTCCGAATCATCTGGGAATTTCACCTGCACGGCCACAAACAGTATCGGCGAAGACTCCATCGTGTACTCGGTGAGAGCAGAGAAGCCGCCATCCGCGCCCGCACTCTCCTTACAGTATACCACCGCTACCAGCATCAAGCTGCATTGGCGACTAGTAGACAACGAGCAGCCGGTCTTAGGTTACATTCTACATTATAAACAAGCCTCTGAAACAGAGTGGACCAGCGTCGAACTTTCACCAGAACAAACTTCCTACACCATGGATATGTTGAGATGTGGAACTACTTACAACGCTAAAATACAAGCCCAGAATAAAATATCTCTGGGACCACCGAGTGAGATACTGACTGCTACAACCAGAGGGGGACGCCCAAAACCGCCCAAGCCAGAAGAGCACGTGCATACAAATGCGACGGCCATCAAGATTAATCTTTACGCCTGGAGGGATGCCGGTTGCCCCATATTGGGCTTTAGAGTCGCTTACCGACGAGCAGGGGACGAGCATTGGATACAGGCGGGTTCTGATTTGAATGCGGCTCGCCACGTAGTAGGAGCCTTGTCTCCAGGCGCGTGGTACGAGCTGGCTGTGGAAGCCTGGAGCGATGCGGGCAGTGAGCGCGTCACGTTACTGGCTGATACTCATACACTTGCTGGAGGTCGTATCCCTCCCCTGCGTGTTTCCCCTCCAGCAGGCGGTGCTCGTACCAGCATCATGAGAACAGCTCTCGCGTGGTGTGCCGCTAGTGCATTACTGCTCGTCGCCACGCTAGCAGCTCTGGTCTGCTTTTTACACGCCAAACGCAAATTCTTCTGTTTCTCTAACGACCACTATTTGAGGGATAACAGGAAACTAAGCGAAAGCAATGAGGCCGAGCGAGAAAAACTACGAGAAGGTCATAAATTATATTCATCTTCGTCAATTAACGGGAACGAAAAGTTGAATGACGATTCATCAGCGGAATTGTACGAGATAAGTCCTTATGCGACATTCGGCGGCGCCGCGGCACACAGTCTGCAGTTCCGTACACTGGCGCGACGCGAGGACGACGCAGCGCCGCCGCATCGACGACGCAGGCGAGCGTGCGATCACTATAGATATGACGAGTCAAGTCTCTCCAAGTGTTCTACTGTGGAGGCGCGTCACCGCCTCCGCGCTGCGCCTGCGCCGGCGCCGCCACATTGGCGAGAGAGATCCGACTCTGACGACTATAGCGACAGCGCGGCCAACACCACTACTAAAGGTTCCGGCGGGTCGGGCGGTGGCAGTGCGTACGGCAGCGGACGGCGCACCGCCAGCAGCGGTGGCCAGAACGGAAGTGACGGCCTGTCAGGGTCCTTCGTGCCGGTGCCGCCAGATATATCATCGTTAATAGACAAGTACCAGCAGAGAAAAGAACAGGAACGTCGGGAATGCACGATACATGTCTAA

Protein sequence:

>DPOGS202957-PA
MTSDALAIRRAELTDDGRWACRAANAHGHVTLRLHLSLRAHLTIHAQPHTQFTILKGDKEFAKRWYRETGGGVLREIREGEGGGRYSMTSDALAIRRAELTDDGRWACRAANAHGHVTLRLHLSLRAHLTIHAQPHTQVVNSGGTAIINCTWAGWPTPRLEWLHNGIPLSAGVAGGRVRIQNGGEQLVITSVHRADKGVYQCVARNERDSAQASAELRLGDTAPELQYTFIEQVLRPGQVVTLKCSAAGSPPPHFSWLLDGQPLNTMARGHRYSIEQFATKSNEVVSYLNITAVRSEDGGLYTCRAANSLGEIAHTSRLNIYGPPYVRSIGPIRAVAGRELVLYCPYSGYPISSVRWERDGSQLEWEGSTGGEGALRISRVEASSAGAYTCSAVGPHGEIARRELQLIVSNPPEIEPFSFSANLQEGKRAQVSCSVASGDMPVRFAWLKDDLPIPADLQVEERGADFFSNLVFKEVSARHSGRYTCVASNSAAKVNYTAELVVKVSPKWLREPMDAAVLAGEQLALHCHTTGSPAPHTTWLKQRAGSASDFVPIINLGGRFQFLSNGTLWIEAALPYDEGYYMCKAENGVGTPLSKTIFVAINEPARFEMTSHNVSARREAGATLACEARGDAPLRVTWSRDNAPLDLSTYRLSISEAKTDSGLRSQLYISRTDRQDSGVYKCQAVNAYGHSDHYIYLSVQEKPEMPQSLSVTEIQSRAVRLSWSAGFDGNSPLHGYTVQYTPLSTQARGERWENAATLNVTWAEHAGHTHSHVTLTKKGDIHYEALLSNLRPHTAYMIRIAAINQIDRSAFTEPVVVKTQEEAPSEAPSGVSVSAGAAGELHVSWRAPPRDAWHGELLGYSVTCAELGPDSAPIPNATRTLTVNGWSASELTLSALKKFTRYEVRVRAFNGIAAGPPSVPVTATTLEGVPEAAPSHVSCSALSSSSLKISWNAPPPALRGGIVQGYKIIYAPLSITHSEGAEMKRVSTTETYLHTLHKYSNYSVQVVAYTAAGDGKRSAPVYCMTEEDVPSAPEKIKALPYSSDSVLVSWLPPLHPNGIISHYTVYYREAGRLGKHSTFTVSADKSPEMELMFQVRNLMENQLYEFWVSATTGSGEGESTLVVGQGPSSRIPARIASFGGTIRVGPGRGALLACLAVGVPPPRTRWAHARAPVTHHRYYQVTRAGHLHIREVNSESSGNFTCTATNSIGEDSIVYSVRAEKPPSAPALSLQYTTATSIKLHWRLVDNEQPVLGYILHYKQASETEWTSVELSPEQTSYTMDMLRCGTTYNAKIQAQNKISLGPPSEILTATTRGGRPKPPKPEEHVHTNATAIKINLYAWRDAGCPILGFRVAYRRAGDEHWIQAGSDLNAARHVVGALSPGAWYELAVEAWSDAGSERVTLLADTHTLAGGRIPPLRVSPPAGGARTSIMRTALAWCAASALLLVATLAALVCFLHAKRKFFCFSNDHYLRDNRKLSESNEAEREKLREGHKLYSSSSINGNEKLNDDSSAELYEISPYATFGGAAAHSLQFRTLARREDDAAPPHRRRRRACDHYRYDESSLSKCSTVEARHRLRAAPAPAPPHWRERSDSDDYSDSAANTTTKGSGGSGGGSAYGSGRRTASSGGQNGSDGLSGSFVPVPPDISSLIDKYQQRKEQERRECTIHV-