Monarch geneset OGS2.0

DPOGS215444
TranscriptDPOGS215444-TA3915 bp
ProteinDPOGS215444-PA1304 aa
Genomic positionDPSCF300298 + 99849-111248
RNAseq coverage1359x (Rank: top 9%)
Annotation
HeliconiusHMEL0163240.067.11% 
BombyxBGIBMGA005736-TA8e-17361.81% 
DrosophilaCG31999-PA5e-7934.76% 
EBI UniRef50UniRef50_E2BUT11e-11931.88%Fibrillin-2 n=5 Tax=Formicidae RepID=E2BUT1_HARSA
NCBI RefSeqXP_001867489.13e-9734.81%fibulin 1 [Culex quinquefasciatus]
NCBI nr blastpgi|3800268582e-12327.68%PREDICTED: fibrillin-2-like [Apis florea]
NCBI nr blastxgi|3071803975e-15928.58%Fibrillin-2 [Camponotus floridanus]
Group
Gene OntologyGO:00055094.4e-12calcium ion binding
GO:00055153.9e-05protein binding
KEGG pathway 
InterPro domain[1087-1128] IPR0018814.4e-12EGF-like calcium-binding
[1087-1127] IPR0130916.7e-10EGF calcium-binding
Orthology groupMCL11117 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215444-TA
ATGAAGTTATTAGTGATGAATATAGTGCTGTGTGTTGTCAGGTTTTCAGTCCAGGGAGCTCTTACGTCAGAGGAAATAGTGGATATAACTGAAACATGCTGCAGTTACGGGGAGATGTTCCTGATGACTTCTCCGGACAAAGATTGTTCTAAACTAGGCACACCTGAAGATATTGAACCCGAACAGATGGAAGCTTGCAAACCAGCCGCAAAAACCTGCTGTGAACAGCAAATACTAAAAATAGACGAATGCAACGCTGGCATAAAGTGGGCTGTTGCAAAGAAATGTCAGACTCCTGAAGATGAAATTGGAAAGACATGTTGCGACGAGTGTTCATTTGGTCGTCTTGCTGGGACTCAGGGTAAGCAGGCCTGTGGAGATGAACCTTCGGAATTCTTGAGCCCTTTAACAGCTTTGAGAAAGATGGCCTATCATAAATGTTGTGTGGAAGCTGCGCAGGAATTAGAGACGACGACGGAGAAAAAGAAAGTAACTACAACCGAAAAACCAAAGGAAAAATGTAAGGCGAACTCTTGTGAGCATAATTGTTCGGACAGTGACGGCAAGGTCACGTGTCTGTGTAAAGATGGTTATAGACTTCAACAAGATAAAAAATCTTGTAAAGATATAAATGAATGTGCAGAAGCCGTAGATGACCTGTGCACAGATAAGGACACTGTGTGCCACAATACTGAGGGATCATTTAAATGTGTGCCTCTTAAGAAGCGAGATGTTGGCCTAAGTTGTCCTCCAGGATTTAAACGAAATGTCGTTAACCAAGTCTGTGACGATATTAATGAATGTCGTCTTCCAAGGCCCCCGTGTCCCAAATACCTTTGTGAAAACACTATCGGTGGTTACAAATGTGCCGGTAAAGTTGGAAAGCCTTACACAGAAGATGGTACAGGACCAACAACTGAGGCCGGAGCTTCAACTTCCTCGACAGTAAGAAATGATATCTGCCCGCCGGGTTTCAGAGCCGGCCCTGACGATGAATGCCTCGATATCGACGAGTGCGAGGAACATTTGGACGACTGCCAGCGTCTGTCACAATATTGTATTAACACTCACGGAAGCTATTTCTGCCAGGACCATGTCTCCAAGCGATGCGCTCCCGGCTTCAAGGTCAATAGTAACACTGGTATATGTGAAGATATCGACGAATGCGAAGAAAGCTCAGAAGTGTGCAAGCGAAACGAAGTTTGCATTAATCTGCCAGGAGCCTACAATTGCAAGTCGAAAATTAGTACACTACCAAAGCTGGCCACACAGAATTGCCAAGAAGGTACTCGCAGAAGAGGAAGCAGTTGCGAAGATATTGACGAATGTCGGGAAGGAACGCATTTGTGCGACCAGTTTCAGAACTGCATTAATACCTTCGCCGGACATGAATGTCGCTGTAAGAACGGTTTCGAGTTAGACTCTACATCTGGATCATGTGTAGATATTGATGAGTGCGCTCTAAAGTTAGACAACTGTGGATCAGAACTGCGTTGTTTGAATGTACTGGGTTCTTTCACTTGTACACGATCAACATCAACACCACCGGCCCCAGTTTATGAATATGAATATTACGACTCCGAAGAGGACAATTCAGTAATTCCAAGTCCAGAAACTACATCATCTACAACGACTTCAACCACAACATCTACAACTACGACAACGACCACGCCAAGACCGACCACAACCAGCTCTACTACTACTTCTACCAGACCATCCACCACACCGAAACCATACCAACCCAGAAGATACCCTAACACACCAAGAAGACCATTCTATCATAGATCTTCCACTTCTACCACCACTAGCACGACTTCAACAACTCCGCCACCGGTTCCAAAATATCCAGAATGGTCGGACTATCCAAGAGAAAACACAACTCCAAAAGAAGTAACAGTTCCAAAACCAGATATAACGAATGTTATCGAAACAGACAAAGAACCAGACGGCAGCTTTGTCCTCAACACCAATGATATCCCAAAGGACAGATGGACCAATGTTATAAACAGAGAGCATGAAAGGTTCAACCCAAACTGGTTACATTGTCTTGATGGATATGAGAGGAACGAACGGGGAGAATGCGTTGACATCAATGAATGCGGAGCCAATCGACATAGTTGCAGTTCCTTAGAGTACTGTATAAATACACCAGGAAGTTATGACTGCGAGTGTATTCCTGGTTTTGTGAGGGATCCATCCGGTTGGTGCGGTGTTGCCACTACTCCCAGTACTTCTCCATCACCACCAACTCAGAGACCAACCACCCTAAGGCTAACTACTTCAAAACCAACCACAACTTCAAGACCTACCACTACTCCAAGACCTACTAGACCACCTAGAATACCTGCGGCTAGACCCACTAGACCTATACCAAGAATAACTCCTAGGACTACAATTAGACCAACAACGACAAGCACTACATCAACGACGTCAACAACCTCACGTAGTACCAACACAAACGAAGTTGCTCCTCTAACACCAACGCCAGCCTGGTATCCGAGTCCATCACGTGGTCATCTCAGCCCTGTTAATTGCGAGCTAGGGTATACCTACAACCACAATGAAAGAAAATGTGTTGATATAGATGAATGTGCTACCCAAAGAGCTAGCTGTGGACCTACAGAGGACTGCGTAAATACAGAAGGAGGATATCGCTGCGAATGTGGCCCTAGATGTCTATCTCGCAGACAAAACACCTCTTATACTTACCACGACAACCCGCCAGTCATCAGTCCAGATTCCAATGTGATCACAATAGGCGCTCAGTACGGCCAGCGAGGGCCGAGGTACATGCGCCCGACATACAAGCGACTCCACGACACGGGATCTGTGCTTACTACATGTCCATGGGGATACAAACTTACACCAGATAGAGTTTGTATGGATTTGGATGAATGTGAGATGAATATCTCCGAGTGTGGCCCGCAGCAGCGTTGTGAAAACTTTTATGGAGGCTACTCGTGCCAATGTCCAGCCGGCCATTGGAGCAACGGCAAGCAATGTGATGACATCGATGAGTGCAGTTATGGCAATACATGCTCCTACAACGCGCGATGCATCAACACTGTCGGGTCCTACCGTTGTGAGTGTTCAGAGGGCTTCAGGAACGCTCCATCTAACGACAAAGTCTGCGTGGATGTAGACGAGTGCTCCGAGCCTGAACCTTTATGTGAACAAGTGTGCGTGAACGCTTGGGGGGGATACAGGTGCTATTGCAATAGGGGCTATAGACTCAGCAATGACAATCGGACTTGTACGGATGTAGATGAATGCGCAGAGTCAGGTTCCCGTATATGCACAGCTCAGTGCGTTAACACCGTGGGCTCCTATCGTTGCGCTTGCCCTTCAGGTTACCGACTGGCTGACGATAAACGATCTTGTCTAGATATTGACGAATGTGAAAATGGCCAGGCTCGCTGCGGTGGAGTGGGAGAGGTTTGTCAGAACACCCGCGGTGGCTACCACTGCCATCAGATAAAATGCCCGCCAGGGTACCGCCTCGAAGGAAAACACAAATGCGCTCGGATACAACGCTCGTGTCCAGTCTCGGACTGGTCGTGTCTTCAGCAACCGAGTACCTACAGCTACAATTTTATAACATTCGTCTCCAACTTGTATTTGCCTCTAGGAAGTGTGGATCTATTCTCTATGCAAGGTCCTGCATGGCGTGATGCTGTAGTGAACTTTGAGATGCGTCTCTTAGACGTGCAAGCGGCGCCTGGAGTCAAACCGGCAGATATCACGTGCTTTGGCATGAGGCCTAGTAGCAACGTCTGTGTGATCTCTCTCCAATGTTCCCTTCAAGGTCCACAAGTAGCTGAATTGGAACTAACCATGTCTCTATACCAAAGATCTATGTTCGCTGGCAACGCTGTCGCCAGACTAGTCGTGATCGTATCAGAATACGAGTACTAA

Protein sequence:

>DPOGS215444-PA
MKLLVMNIVLCVVRFSVQGALTSEEIVDITETCCSYGEMFLMTSPDKDCSKLGTPEDIEPEQMEACKPAAKTCCEQQILKIDECNAGIKWAVAKKCQTPEDEIGKTCCDECSFGRLAGTQGKQACGDEPSEFLSPLTALRKMAYHKCCVEAAQELETTTEKKKVTTTEKPKEKCKANSCEHNCSDSDGKVTCLCKDGYRLQQDKKSCKDINECAEAVDDLCTDKDTVCHNTEGSFKCVPLKKRDVGLSCPPGFKRNVVNQVCDDINECRLPRPPCPKYLCENTIGGYKCAGKVGKPYTEDGTGPTTEAGASTSSTVRNDICPPGFRAGPDDECLDIDECEEHLDDCQRLSQYCINTHGSYFCQDHVSKRCAPGFKVNSNTGICEDIDECEESSEVCKRNEVCINLPGAYNCKSKISTLPKLATQNCQEGTRRRGSSCEDIDECREGTHLCDQFQNCINTFAGHECRCKNGFELDSTSGSCVDIDECALKLDNCGSELRCLNVLGSFTCTRSTSTPPAPVYEYEYYDSEEDNSVIPSPETTSSTTTSTTTSTTTTTTTPRPTTTSSTTTSTRPSTTPKPYQPRRYPNTPRRPFYHRSSTSTTTSTTSTTPPPVPKYPEWSDYPRENTTPKEVTVPKPDITNVIETDKEPDGSFVLNTNDIPKDRWTNVINREHERFNPNWLHCLDGYERNERGECVDINECGANRHSCSSLEYCINTPGSYDCECIPGFVRDPSGWCGVATTPSTSPSPPTQRPTTLRLTTSKPTTTSRPTTTPRPTRPPRIPAARPTRPIPRITPRTTIRPTTTSTTSTTSTTSRSTNTNEVAPLTPTPAWYPSPSRGHLSPVNCELGYTYNHNERKCVDIDECATQRASCGPTEDCVNTEGGYRCECGPRCLSRRQNTSYTYHDNPPVISPDSNVITIGAQYGQRGPRYMRPTYKRLHDTGSVLTTCPWGYKLTPDRVCMDLDECEMNISECGPQQRCENFYGGYSCQCPAGHWSNGKQCDDIDECSYGNTCSYNARCINTVGSYRCECSEGFRNAPSNDKVCVDVDECSEPEPLCEQVCVNAWGGYRCYCNRGYRLSNDNRTCTDVDECAESGSRICTAQCVNTVGSYRCACPSGYRLADDKRSCLDIDECENGQARCGGVGEVCQNTRGGYHCHQIKCPPGYRLEGKHKCARIQRSCPVSDWSCLQQPSTYSYNFITFVSNLYLPLGSVDLFSMQGPAWRDAVVNFEMRLLDVQAAPGVKPADITCFGMRPSSNVCVISLQCSLQGPQVAELELTMSLYQRSMFAGNAVARLVVIVSEYEY-