as a PDF
Transcription
as a PDF
International Journal of Innovative Computing, Information and Control Volume 2, Number 4, August 2006 c ICIC International °2006 ISSN 1349-4198 pp. 885—896 REDESIGNING SUBGROUPS IN A PERSONNEL NETWORK BASED ON DNA COMPUTING Ikno Kim, Don Jyh-Fu Jeng and Junzo Watada Graduate School of Information, Production and Systems Waseda University 2-7 Hibikino, Wakamatsu-ku, Kitakyushu-shi, Fukuoka 808-0135, Japan [email protected]; [email protected]; [email protected] Received July 2005; revised November 2005 Abstract. Employees’ tasks have been changing according to the current times by technical development and reform in complicated human relationships, and their capabilities and abilities are also continuing to improve. Therefore, the necessity for a competent rating to qualitatively understand relations between employees arises. In this paper, we select a model of a workplace where employees are sharing information from a variety of workplaces, and we suppose a personnel network which contains their relations in terms of mutual understanding. Some subgroups are sure to exist in this employee personnel network, and the personnel network can be improved by redesigning all the cliques of subgroup networks. However, with a huge number of employees it becomes extremely hard to find the maximum clique in the personnel network, meaning this problem is NP-hard. All cohesive subgroups can be found by making the best use of DNA-based computing, also known as molecular computation, which is a new approach to massively parallel computation. DNA computing is drawing attention from many researchers around the world. The goal of this paper is to propose a way to apply DNA computing, one of the new biotechnologies, to human resource management that is a part of the management engineering field, and to measure the efficiency of DNA computing in redesigning subgroups to support work rotation. Keywords: Subgroup, Personnel network, Work rotation, Clique, DNA computing 1. Introduction. Routine tasks that are very specialized, such as assembly-line positions, hold limited appeal in advanced industrial societies. Rarely do these routine tasks offer opportunities for achievement, recognition, psychological growth, or other sources of satisfaction. To enhance the quality of work life for those who hold such responsibilities, human resource managers can use a variety of methods to improve tasks. The most widely practiced technique includes work rotation [1,2]. Work rotation moves employees from task to task. Tasks themselves are not actually changed a lot, only the employees are rotated. Rotation breaks the monotony of highly specialized work by calling on different skills and abilities. The organization benefits because employees become competent in several tasks rather than only one. Knowing a variety of tasks improves the employee’s self-image, provides personal growth, and makes the employee more valuable to the organization. The most important point is to note all the subgroups [3] in the personnel network to execute work rotation. The most important reason is that the personnel network might 885 886 I. KIM, D. J. JENG AND J. WATADA become a huge number of employees composed of all the connected subgroups, and those subgroups are also composed of subgroups, both small and large. Another important reason is that human resource managers hope to find the maximal number of employees who mutually exchange and share their information by close interpersonal relationships at work. However, even if employees’ close interpersonal relationships in the huge personnel network are known, it would be quite difficult to redesign all the subgroups efficiently, because the maximum cliques [4] from all the subgroups in the huge personnel network must be extremely hard to discern exactly. Furthermore, finding the maximum clique of employees becomes NP-hard, and cannot be solved using present electronic computers in polynomial time unless P=NP, which is widely believed to be false. Ordinarily, the majority of human resource managers redesign subgroups statistically based on either the frequency of employees’ relationships or managers’ experiences. To address this, we use DNA computing to solve these redesign problems of the subgroups in the personnel network. DNA computing has drawn attention in various fields [5,6] since it was proposed to be able to solve a Hamiltonian path with molecular computation by L. Adleman in 1994 [7]. Until now, there are no cases that apply to management problems. The attention is almost entirely from either the computer science fields or the biotechnology fields. However, we argue that DNA computing should be a very useful tool for a variety of management problems. Thus, we have shown that efficient solutions are obtained by DNA computing, and the efficiency of DNA computing is examined by redesigning subgroups in a personnel network to solve one of the management problems. 2. Analysis of Subgroups. A personnel network is composed of various types of subgroups. It is necessary to analyze all the subgroups to accurately understand the personnel network. Therefore, in this section, we describe and analyze subgroups in more detail to provide a better understanding of this personnel network problem. 2.1. Cohesive subgroups. Cohesion is closely related to concepts of strong ties among members of embedded social groups or closed social circles. Cohesive subgroups [3] are subsets of actors among whom there are relatively strong, dense, direct, frequent, intense, or positive ties. These are relations that enable employees to share or exchange their information, create solidarity, or act collectively. Numerous direct contacts among all cohesive subgroup employees, combined with few or null ties to outsiders, dispose a group toward a close interpersonal relationship in business, homogeneity of thought, behavior, and identity. Examples of formal cohesive groups include the personnel department, production department, quality control department, and finance department, whereas informal cohesive groups include religious cults and hobby clubs. Structural variables are measured on a single set of actors, such as relationships among employees give rise to one-mode networks, whereas there are types of structural variables that are measured on two sets of entities, such as actors of employees from different sets, one set consisting of corporations and a second set consisting of non-profit organizations. It could be measured the flows of financial support flows from corporations to non-profit employees. These two sets of employees are referred to as a two-mode network. A onemode network is the most common type of personnel network since all actors of employees come from only one set. Cohesive subgroups in one-mode networks focus on properties of REDESIGNING SUBGROUPS IN A PERSONNEL NETWORK 887 pair-wise ties, whereas cohesive subgroups in two-mode affiliation networks focus on ties existing among employees through their joint relationships in organizations. Figure 1. Presonnel network and nodal degrees for employees’ relations 2.2. Cliques. A cohesive subgroup should have a clique that is a useful starting point for specifying the formal properties. The clique also has well-specified mathematical properties, and captures much of the intuitive notion of cohesive subgroups. In a graph, the clique is a maximal complete subgraph of three or more nodes, and it consists of a subset of nodes, all of which are adjacent to each other, and there are no other nodes that are also adjacent to the employees of the clique. The most common concept for real personnel networks is the n-clique, which is much closer to people’ everyday understanding of the word ‘clique’. In this concept, n is the maximum path length at which employees of the clique will be regarded as connected. Thus, a 1-clique should be the maximal complete subgraph itself, and the subset in which all pairs of points are directly connected at distance only 1. On the other hand, a 2clique should be one in which the employees are either directly connected at distance 1 or indirectly connected at distance 2 through a common neighbor. 2.3. Maximum clique. A complete graph is a simple graph where an edge connects every pair of nodes in graph theories. A graph G = (N , E) is complete if all its nodes are pair-wise adjacent. A clique C in a graph G is a subset of nodes N such that the induced graph G(C) is complete. That is, there is an edge between every pair of distinct nodes in N . Clique problem takes a graph G and an integer k as input and asks whether there 888 I. KIM, D. J. JENG AND J. WATADA is a clique in G of size at least k. The cardinality of C is the number of nodes contained in C, denoted by |C|. Thus, the objective of the maximum clique is to find a clique of maximum cardinality in G. Furthermore, the maximum clique can be represented in the maximum number of employees who mutually exchange and share their information by close interpersonal business relationships, also the inclusiveness and the density of a complete relational graph should be 1 in the maximum clique. 2.4. Model graph of personnel networks. We select actors of employees in the same firm to create their own personnel network based on employees’ records and descriptions. This personnel network is a highly skilled group in a consulting firm that was first organized by the human resource manager, and was made for providing thoughtful leadership and specialized support to the firm’s knowledge management consultants; the group was composed of employees who had advanced degrees or extensive industry experience in technical fields. The network for this model is given in Figure 1. In the graph, there are N = 20 nodes, and E = 32 edges between the pairs of nodes. Even with as few as twenty actors of employees and thirty-two ties, the graph looks very complicated. There are also the four circles that represent present subgroups which do not really look like cohesive subgroups, and the connected lines that represent the relationships among the employees who mutually exchange and share their business information through a close interpersonal relationship in business. Although this personnel network is very intricately connected with four subgroups, it is important for the human resource manager to redesign better subgroups by finding a subgroup of all the cliques including the maximum clique, in order to satisfy work rotation requirements with a new personnel network. 3. DNA Computing Approach. DNA computing is recently drawing attention from a number of scientists, engineers and other researchers [8-10]. The attention is almost always focused on solving NP-completeness problems, combinational problems and difficult mathematical problems. However, we propose a way to solve one of the management problems using DNA computing. Therefore, in this section, we describe how to relate employees to DNA sequences, and how to approach redesigning subgroups based on a DNA experiment. Figure 2. Watson-crick complementarity REDESIGNING SUBGROUPS IN A PERSONNEL NETWORK 889 3.1. DNA computing. DNA computing is that using bio-molecules as a support for devising computers and computations, a new approach to massively parallel computation. In 1994, L. Adleman discovered a way to solve the Hamiltonian path problem, which is one of the NP-complete problems, using only biochemical techniques. He used DNA to solve a 7-node, a special case of an NP-complete problem that attempts to visit every node in a graph exactly once. Deoxyribonucleic acid consists of polymer chains, call DNA strands, that are composed of nucleotides adenine (A), guanine (G), cytosine (C) and thymine (T). Adenine always bonds with only thymine, while guanine always bonds with only cytosine. This phenomenon is called Watson-Crick complementarity. 3.2. Algorithm to find cliques. Basically, the ideas of analyzing and designing cohesive subgroups can define specific graph theoretic properties that should be satisfied in order to identify a subset of employees. For these kinds of approaches to cohesion in personnel networks, examining a set of personnel network data is to find collections of employees who have relatively strong ties, and it becomes visible by displaying functions or rearrangement of these networks. We note one of the approaches to cohesive subgroup analysis that is a socio-matrix [11], because this is the most important analyzing procedure to find all the cliques of the subgroups using DNA computing. A systematic way for ordering rows and columns of the socio-matrix reveals the subgroup structure of the personnel network. Figure 3 shows a socio-matrix of the model graph with the rows and columns, employees who have ties and close interpersonal business relationships to each other in the socio-matrix. The socio-matrix of size m × m becomes 20 rows and 20 columns for the model graph. There is a row and column for each node, and the rows and columns are labeled 1, 2, 3,. . . , 20. xij denotes the value of the tie from employee i to employee j, and xij records which pairs of nodes are adjacent. If nodes Ni and Nj are adjacent, then xij = 1, and if nodes Ni and Nj are not adjacent, then xij = 0. In addition, an edge between two nodes is either present or absent. If an edge is present, it goes both from Ni to Nj and from Nj to Ni , thus, xij = 1, and xji = 1. The total sum of rows and columns represents the level of the connections between edges, which is divided by 2, becomes L= m m P P i=1 j=1 2 xij f or i 6= j (1) where the number of edges corresponds to L. As shown in Figure 3, the total sum of rows and columns is 64, which means the number of edges should be 32. The number of edges should be equal to the number of all the independent lines in a 2-node. Furthermore, the connectivity matrix for 20 employees shown in Figure 1 is the 20 × 20 socio-matrix in Figure 3. This socio-matrix makes the best use of this DNA application because it is simple, and easy to understand how many DNA fragments and sequences we will need for the DNA experiment. We designed a new algorithm based on the socio-matrix in Figure 3 and the algorithm of the maximal clique problem solution that was proposed by Q. Ouyang [8], the method to find the maximum clique. However, the new algorithm that finds all the 1-cliques, as 890 I. KIM, D. J. JENG AND J. WATADA Figure 3. Socio-matrix for the personnel network of 20 employees well as the maximum clique, because we would like to design more cohesive and efficient subgroups in the personnel network based on the DNA results. Step 1: For a graph with N nodes, each possible clique is represented by a binary number, which 1 is in the clique, and 0 is vise versa. Also, if the total is 1, 1 is in the independent line. Step 2: Create all the possible combinations from 2N ; in this case, it becomes 220 =1048576 where N=20. Step 3: The graph containing all edges missing in the original graph is called the complementary graph. Any two nodes connected in the complementary are illegally connected in the original graph. Remove those cliques and independent lines that contain illegal connections, corresponding to xij = 0 in Figure 3. Step 4: Sort the remaining data pool to select the DNA-sequence from 2 bits of value 1 to 20 bits of the value 1 in existence. REDESIGNING SUBGROUPS IN A PERSONNEL NETWORK 891 Step 5: Find the nodes that connected between the possible cliques, and distinguish those connected cliques from other cliques. 3.3. Experiment. In this experiment, the DNA-sequence is designed in the form of double-stranded DNA, and it corresponds to 20 nodes while satisfying the given algorithm above. Each node of the DNA-sequence in a binary number is composed of two sequences that are a position sequence Ei and a value sequence Ni . The position sequences are used for connecting each node of the DNA-sequence, and the value sequence is used for distinguishing whether those position sequences contain that node or not. For a twenty-digit binary number in the graph, twenty value sections (N1 to N20 ) are prepared sandwiched sequentially between twenty-one position sections (E1 to E21 ). To amplify the DNA sequence using polymerase chain reaction (PCR), the last section E21 is needed. We set Ei with the length of 10 base pairs (bp), Ni with the length of 0 bp if the value of 1, and 6 bp if the value of 0. In this case, the longest DNA has 330 bp, and the shortest DNA has 210 bp. As Table 1 shows, we created 40 oligonucleotides for the experiment. Each oligonucleotide consists of two different position motifs, Ei Ni Ei+1 for odd i, and Ei+1 Ni Ei for even i, where the boxed represents the complementary sequence, and the value of Ni can be 1 or 0. The 40 fragment oligonucleotides were mixed together for thermal cycling. We need to remove the complementary sequences that contained illegal connections. Therefore, as Table 2 shows, we selected the 20 restriction enzyme sites through on technical references that are provided from New England BioLabs. Finally, we repeat selecting the shortest DNA strands, which correspond to all the possible cliques using a gel electrophoresis apparatus. 3.4. Experiment results. The experiment in repetition was able to find all the possible cliques including the maximum clique in the test tube. We decide the size of the clique by seeing the length of DNA strands. The clique of largest size is represented by the shortest length of DNA. As Table 3 shows, the maximum clique is {N2 , N7 , N9 , N18 , N20 }, which is connected at distance 300 bp by five nodes. The second largest clique is {N3 , N8 , N10 , N19 }, which is connected at distance 306 bp by four nodes. There are also six possible cliques that are all connected by three nodes, and the two independent lines with a 2-node were found as well. In addition, there are some nodes that represent empty black boxes which are vertically arranged in Table 3, so we can understand the possible cliques which are mutually connected together and find out all the nodes are divided into three components that also can be a component and two subgroups. 4. Redesign of Subgroups. Although there are many different kinds of cliques in a huge and complicated network, we hope to find all the cliques, and we hope to arrange them in the large order of cohesion. Moreover, even though we do not know exactly which cliques connect with which others or how connected subgroups are in the network, we hope to redesign subgroups efficiently. 4.1. Personnel network analysis. All these problems have been known from the results of the DNA experiment results. The personnel network is redesigned based on all the DNA experiment results, as shown in Figure 4. To prove the efficiency of the redesigned 892 I. KIM, D. J. JENG AND J. WATADA Table 1. Restriction enzyme sites are indicated by underlining and value sequences Nij , where j indicates the value of Ni . The value sequences are written with outlined letters. REDESIGNING SUBGROUPS IN A PERSONNEL NETWORK 893 c Table 2. 20 restriction enzyme sites °New England BioLabs Inc. Table 3. Experiment results of the DNA-sequences for 8 cliques and 2 lines personnel network using DNA computing, we show the differences between the redesigned personnel network and the previous personnel network. Therefore, we calculated the inclusiveness and the density of the both personnel networks [12]. Recall that a complete graph is one in which all the nodes are adjacent to one another. The concept of density is an attempt to summarize the overall distribution of edges in order to measure how far from this state of completion the graph is. First, inclusiveness defines the number of nodes that are included within the various connected parts of the graph. The inclusiveness of a graph is the total number of nodes minus the number of isolated nodes. The most useful measure of inclusiveness for comparing various types of graphs is the number of connected nodes expressed as a proportion of the total number of nodes. Thus, the proportion of the inclusiveness is 894 I. KIM, D. J. JENG AND J. WATADA Figure 4. Redesigned personnel network and nodal degrees for employees’ relations denoted by Π, and the inclusiveness is calculated as follows: n X Ni f or 0 ≤ Π ≤ 1 Π = C/ (2) i=1 where C is the number of connected nodes. Secondly, the density of a graph is defined as the number of edges in a graph. The density also expressed as a proportion of the maximum possible number of edges. The density is denoted by ∆, and the density is calculated as follows: L ∆= f or 0 ≤ ∆ ≤ 1 (3) n(n − 1)/2 where there are n(n − 1)/2 possible unordered pairs of nodes, and thus n(n − 1)/2 possible edges that could be presented in the graph. Table 4 shows the results of the inclusiveness, the density, and others for the both personnel networks. 4.2. Analysis results. As Figure 4 shows, it is known that the new subgroup 1 and subgroup 2 are connected by the relations between the two employees, N3 (Yoshiro) and N20 (Seiji). These two employees are appropriate to be information communicators, to exchange their information between subgroups 1 and 2. In addition, subgroups 3 and 4 show that, if the number of degrees is considered, N16 (Yasuo) could be a leader for subgroup 3, and N13 (Gohei) could be a leader for subgroup 4. Furthermore, it can be said that, if the four employees who are N20 (Seiji), N3 (Yoshiro), N16 (Yasuo), and N13 (Gohei) are connected, the network could be an integrated personnel network. REDESIGNING SUBGROUPS IN A PERSONNEL NETWORK 895 Table 4. Inclusiveness and density comparisons Finally, following the results in Figure 4 and Table 4 we can measure the efficiency of DNA computing in redesigning subgroups. Thus, it is better to use DNA computing rather than Figure 1 when human resource managers redesign subgroups in any kinds of personnel networks or other social networks to support work rotation in efforts to improve the quality of work life. 5. Concluding Remarks. In this paper, the subgroups were redesigned more efficiently and effectively as shown in the new personnel network. A massively parallel computation corresponds to DNA computing that was able to be done to make a new personnel network for an efficient work rotation. In addition, we show various ideas on human resource management based on the results of DNA computing. On the other hand, there were some problems, mainly that mistakes often happen in cutting DNA strands, and the number of nodes is limited to processes with picomole operations. Further research includes investigating DNA computing for nodes in signed or valued graphs to develop a proactive systematic approach to solving a real personnel network problem, and expanding the different kinds of applications for managerial problems from a variety of industrial and management fields in order to support optimal decision-making. REFERENCES [1] Umezu, H., MBA Management of Human Resources and Organization, Seisansei-Shuppan, pp.137157, 2003. [2] Werther, W. B., Jr. Werther and D. Keith, Human Resources and Personnel Management, McGrawHill, Inc., pp.145-157, 1995. [3] Wasserman, S. and K. Faust Social Network Analysis, Cambridge University Press, pp.149-290, 1999. [4] van Noort, D., F.-U. Gast and J. S. McCaskill, DNA computing in microreactors, submitted to Special Lectures in DNA Computing, The Graduate School of Information, Production and Systems, Waseda University, pp.1-10, 2005. [5] Jeng, D. J.-F., J. Watada and I. Kim, Solving a real time scheduling problem based on DNA computing, Proc. of the 2005 International Conference on Intelligent Technologies and Applied Statistics, Taipei, Taiwan, pp.317-322, 2005. [6] Liu, Q., L. Wang, A. G. Frutos, A. E. Condon, R. M. Corn and L. M. Smith, DNA computing on surfaces, Nature, vol.403, pp.175-179, 2000. [7] Adleman, L., Molecular computation of solutions to combinatorial problems, Science, vol.266, pp.1021-1024, 1994. [8] Ouyang, Q., P. D. Kaplan, S. Liu and A. Libacher, DNA solution of the maximal clique problem, Science, vol.278, pp.446-449, 1997. 896 I. KIM, D. J. JENG AND J. WATADA [9] Sakamoto, K., D. Kiga, K. Komiya, H. Gouzu, S. Yokoyama, S. Ikeda, H. Sugiyama and M. Hagiya, State transitions by molecules, Biosystems, vol.52, pp.81-91, 1999. [10] Ibrahim, Z., Y. Tsuboi, M. S. Muhammad, O. Ono and M. Khalid, Parallel overlap assembly for initial pool generation of direct-proportional length-based DNA computing, International Journal of Innovative Computing, Information & Control, vol.1, no.2, pp.261-275, 2005. [11] Carrington, P. J., J. Scott and S. Wasserman, Models and Methods in Social Network Analysis, Cambridge University Press, pp.77-97, 2005. [12] Yasuda, Y., Practice Network Analysis, Shinyosha, pp.55-63, 2004.