Mining Negative Associations from Medical Databases Considering Frequent, Regular, Closed and Maximal Patterns
Abstract
:1. Introduction
- Support (i001, ~i002, i003) = support (i001, i003) – sup (i001, i002, i003)
- —supp (A) ≥ ms, supp (B) ≥ ms, and supp (A∪B) < ms, where ms = Minimum Support
- —supp (A⇒¬B) = supp (A∪¬B); and —conf (A⇒¬B) = supp (A∪¬B)/supp (A) ≥ mc, where mc = minimum confidence.
- Important definitions related to this paper are placed in Appendix A.
1.1. Problem Definition
1.2. Objectives of the Research
- 1.
- To construct a database and generate an example set which can be used to experiment to find the negative associations among the regular, frequent, closed, and maximal items.
- 2.
- To develop an algorithm that finds the frequent, regular, closed, and maximal item sets and finds negative associations among those item sets.
- 3.
- Find the most optimum threshold values for frequency and regularity in which the most accurate negative associations can be found.
2. Related Work
3. Methods for Computing Negative Association among Frequent, Regular, Closed and Maximal Item Sets
3.1. Method 1
- If item sets X and Y are numerous but rarely occur together, then sup (X∪Y) < sup (X) * sup (Y), indicating a negatively correlated pattern.
- If sup (X∪Y) < sup (X) * sup (Y), X and Y are substantially negatively correlated, resulting in a strongly negatively correlated pattern. This definition can be extended to k-item sets. However, null transactions occur.
3.2. Method 2
- Sup (X¬E) * sup (¬A∪E) < sup (X∪E) * sup (¬A∪¬E), causing a null transaction to indicate the existence of a negative association.
3.3. Method 3
- Suppose that item sets A and B are frequent, i.e., sup (A) ≥ min-sup, sup (B) ≥ min-sup, where min-sup is the minimum support threshold.
- Then, P(A|B) + P(B|A)/2 < ∈, where ∈ is the negative pattern threshold. This way of computing the negative association is free from the problem of null transactions.
4. Computing the Negative Associations from Regular, Frequent, Closed and Maximal Item Sets
Algorithm 1 Mining negative associations from regular, frequent, closed and maximal item sets |
|
P.SL.No | Transaction ID | Patient Number | Disease | Drug | Chemicals | Drug | Chemicals | |||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
1 | T1 | P100 | DE1 | DR1 | CH1 | CH2 | CH3 | NA | NA | DR2 | CH4 | CH5 | CH9 | CH10 |
T2 | P100 | DE2 | DR3 | CH4 | CH5 | CH6 | NA | NA | DR4 | CH10 | CH15 | NA | NA | |
T3 | P100 | DE3 | DR5 | CH2 | CH3 | CH7 | NA | NA | DR6 | CH13 | CH14 | CH15 | NA | |
2 | T4 | P223 | DE4 | DR7 | CH5 | CH8 | CH10 | NA | NA | DR8 | CH11 | CH15 | NA | NA |
T5 | P223 | DE5 | DR9 | CH1 | CH3 | CH5 | CH16 | CH19 | NA | NA | NA | NA | NA | |
3 | T6 | P749 | DE6 | DR10 | CH4 | CH5 | CH16 | CH19 | NA | NA | NA | NA | NA | NA |
4 | T7 | P937 | DE7 | DR11 | CH2 | CH3 | CH7 | CH11 | NA | DR12 | CH12 | CH13 | NA | NA |
5 | T8 | P119 | DE8 | DR13 | CH5 | CH8 | CH11 | NA | NA | DR14 | CH12 | CH14 | CH15 | NA |
T9 | P119 | DE9 | DR15 | CH1 | CH3 | CH5 | NA | NA | DR16 | CH8 | CH9 | NA | NA | |
T10 | P119 | DE10 | DR17 | CH2 | CH3 | CH7 | CH8 | NA | DR18 | CH13 | CH14 | CH15 | NA | |
6 | T11 | P1235 | DE11 | DR19 | CH5 | CH8 | CH11 | CH15 | NA | DR20 | NA | NA | NA | NA |
7 | T12 | P11 | DE12 | DR21 | CH4 | CH5 | CH6 | NA | NA | DR22 | CH10 | CH15 | NA | NA |
T13 | P11 | DE13 | DR23 | CH2 | CH3 | CH7 | CH8 | NA | DR24 | CH13 | CH14 | CH15 | NA | |
T14 | P11 | DE14 | DR25 | CH5 | CH8 | CH11 | CH15 | NA | DR26 | NA | NA | NA | NA | |
8 | T15 | P4573 | DE15 | DR27 | CH1 | CH3 | CH5 | NA | NA | DR28 | CH9 | CH11 | NA | NA |
T16 | P4573 | DE16 | DR29 | CH4 | CH5 | CH6 | NA | NA | DR30 | CH14 | CH15 | NA | NA | |
9 | T17 | P8765 | DE17 | DR31 | CH2 | CH3 | CH6 | CH7 | NA | DR32 | CH12 | CH13 | NA | NA |
T18 | P8765 | DE18 | DR33 | CH5 | CH8 | CH11 | CH12 | NA | DR34 | CH14 | CH15 | NA | NA | |
10 | T19 | P10987 | DE19 | DR35 | CH1 | CH3 | CH5 | NA | NA | DR36 | CH6 | CH9 | CH10 | NA |
T20 | P10987 | DE20 | DR37 | CH4 | CH5 | CH6 | NA | NA | DR38 | CH12 | CH14 | CH15 | NA | |
T21 | P10987 | DE21 | DR39 | CH2 | CH3 | CH4 | NA | NA | DR40 | CH7 | CH13 | NA | NA | |
T22 | P10987 | DE22 | DR41 | CH5 | CH8 | CH11 | NA | NA | DR42 | CH12 | CH15 | NA | NA | |
T23 | P10987 | DE23 | DR43 | CH1 | CH3 | CH5 | NA | NA | DR44 | CH9 | CH14 | NA | NA |
Chemical Code | Transaction Ids | Maximum Regularity (4) | Minimum Frequency (3) | |||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CH1 | T1 | T5 | T9 | T13 | T17 | T21 | 4 | 6 | ||||||||||
CH2 | T1 | T3 | T7 | T11 | T5 | T9 | 6 | 6 | ||||||||||
CH3 | T1 | T3 | T5 | T7 | T9 | T11 | T13 | T15 | T17 | T19 | T21 | 2 | 11 | |||||
CH4 | T1 | T2 | T6 | T10 | T14 | T18 | T19 | 4 | 7 | |||||||||
CH5 | T1 | T2 | T4 | T5 | T6 | T8 | T9 | T10 | T12 | T13 | T14 | T16 | T17 | T18 | T20 | T21 | 2 | 16 |
CH6 | T2 | T5 | T6 | T10 | T14 | T15 | T17 | T18 | 4 | 8 | ||||||||
CH7 | T3 | T7 | T11 | T15 | T19 | 4 | 5 | |||||||||||
CH8 | T4 | T8 | T9 | T11 | T12 | T16 | T20 | 4 | 7 | |||||||||
CH9 | T1 | T5 | T9 | T13 | T17 | T21 | 4 | 6 | ||||||||||
CH10 | T1 | T2 | T4 | T10 | T17 | 7 | 5 | |||||||||||
CH11 | T4 | T7 | T8 | T12 | T13 | T16 | T20 | 4 | 7 | |||||||||
CH12 | T7 | T8 | T15 | T16 | T18 | T20 | 7 | 6 | ||||||||||
CH13 | T3 | T7 | T11 | T15 | T19 | 4 | 5 | |||||||||||
CH14 | T1 | T3 | T8 | T11 | T14 | T16 | T18 | T21 | 5 | 8 | ||||||||
CH15 | T2 | T3 | T4 | T6 | T8 | T10 | T12 | T14 | T16 | T18 | T20 | 7 | 11 |
Chemical Code | Transaction Ids | Maximum Regularity (4) | Minimum Frequency (3) | |||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CH1 | T1 | T5 | T9 | T13 | T17 | T21 | 4 | 6 | ||||||||||
CH3 | T1 | T3 | T5 | T7 | T9 | T11 | T13 | T15 | T17 | T19 | T21 | 2 | 11 | |||||
CH4 | T1 | T2 | T6 | T10 | T14 | T18 | T19 | 4 | 7 | |||||||||
CH5 | T1 | T2 | T4 | T5 | T6 | T8 | T9 | T10 | T12 | T13 | T14 | T16 | T17 | T18 | T20 | T21 | 2 | 16 |
CH6 | T2 | T5 | T6 | T10 | T14 | T15 | T17 | T18 | 4 | 8 | ||||||||
CH7 | T3 | T7 | T11 | T15 | T19 | 4 | 5 | |||||||||||
CH8 | T4 | T8 | T9 | T11 | T12 | T16 | T20 | 4 | 7 | |||||||||
CH9 | T1 | T5 | T9 | T13 | T17 | T21 | 4 | 6 | ||||||||||
CH11 | T4 | T7 | T8 | T12 | T13 | T16 | T20 | 4 | 7 | |||||||||
CH13 | T3 | T7 | T11 | T15 | T19 | 4 | 5 |
Chemical Code | Transaction Ids | Maximum Regularity (4) | Minimum Frequency (3) | |||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
CH3 | T1 | T3 | T5 | T7 | T9 | T11 | T13 | T15 | T17 | T19 | T21 | 2 | 11 | |||||
CH4 | T1 | T2 | T6 | T10 | T14 | T18 | T19 | 4 | 7 | |||||||||
CH5 | T1 | T2 | T4 | T5 | T6 | T8 | T9 | T10 | T12 | T13 | T14 | T16 | T17 | T18 | T20 | T21 | 2 | 16 |
CH6 | T2 | T5 | T6 | T10 | T14 | T15 | T17 | T18 | 4 | 8 | ||||||||
CH8 | T4 | T8 | T9 | T11 | T12 | T16 | T20 | 4 | 7 | |||||||||
CH11 | T4 | T7 | T8 | T12 | T13 | T16 | T20 | 4 | 7 |
5. Data Set for Experimentation
6. Results and Discussion
6.1. Results—Implementation of the Algorithm 1 on the Dataset
- Step 1:
- Extract sample data from the database.
- Step 2:
- Add transaction IDs to the extracted data from the database.
- Step 3:
- Convert Table 2 into a vertical format.
- Step 4:
- Prune the records that do not meet the threshold levels of regularity and frequency.
- Step 5:
- Prune the records which do not satisfy the maximality and the closedness criteria.
- Step 6:
- Find the negatively associated chemicals
- Step 7:
- Find negatively associated drugs
6.2. Discussion
7. Conclusions and Future Scope
Author Contributions
Funding
Institutional Review Board Statement
Data Availability Statement
Conflicts of Interest
Appendix A. Important Definitions and Representations
Key Element | Description of the Key Element |
Item Set | A set of Items that together appear in a Transaction |
K-Item Set | An item set having k number of items |
Closed Item Sets | An Item set is closed in a data set D if no super item set Y exists, such that Y has the same support count as X in D. |
Frequency of an item set | The number of transactions containing a specific item set is called absolute support. |
An item set is said to be frequent if the relative support or absolute support is less than >the minimum threshold value’. | |
Regular item sets | Regular item sets are those that occur many times within a specific time. The periodicity could be absolute or relative, which is measured as the distance between the transactions containing an item set. The regularity of the item set plays a major role when an unexpected disease occurs every time a specific drug is administered. |
Closed frequent item sets | An item set X is a closed frequent item set in D if X is both closed and frequent in D. |
Maximal frequent item set | An item set X is the maximal frequent item set in D when X is frequent, and no super item set Y exists such that X ⊂ Y and Y are frequent in D. |
Closed and Maximal Item set. | An item set is closed and maximal if the principality of maximality applies to closed item sets. |
Closed and maximal item sets substantially reduce the number of patterns generated in frequent item set mining while preserving complete information regarding the set of frequent item sets. That is, the frequent item sets and the related support values can be easily derived from the set of closed item sets. It is more desirable to mine closed frequent item sets rather than all set of all frequent item sets. | |
Interestingness measures | A set of measures (support, confidence, correlation, etc.) that reveal the interestingness of an item set for the user |
Association Rule | Is a rule applied to a set of item sets and triggers whether a rule reveals a positive association or negative association among the item sets |
Support of an association rule | The support of the association rule A –>B is the percentage of transactions that contain A∪B or P(A∪B) |
Confidence of an association rule | The confidence of the rule A –>B is the percentage of transactions in D that contain A and B. This is taken to be the conditional probability P(B | A) |
Strong association rules | The rules that satisfy minimum support and minimum confidence are called string association rules. |
Correlation | Item sets A and B are set to be correlated if the correlation coefficient between A and B is positive. Lift is a measure of the correlation between the item sets, and item sets A and B are independent when |
Lift | = P(B|A)/P(B) = conf (A⇒B)/sup (B) |
Item merging | It is a pruning method to reduce the number of items for finding the applicable rules. If every transaction containing a frequent item set X also contains an item set Y and X is not a superset of Y, then X∪Y forms a closed item set, and there is no need for searching for item set X but no Y. |
Sub item pruning | It is a pruning method to reduce the number of items for finding the applicable rules. If a frequent item X is a proper subset of an already found frequent closed item set Y and support count (X) = support count (Y), then X and all its descendants in the set enumeration tree cannot be frequent closed item sets and thus can be pruned. |
Item skipping | It is a pruning technique when a database is mined to find a hierarchical structur by employing depth-first; mining closed item sets at each level is undertaken. An item set X is associated with a header table and projected database. If a local frequent item p has the same support in several header tables at different levels, prune p from the header table at higher levels. |
References
- Aggarwal, C.C.; Yu, P.S. Mining associations with the collective strength approach. IEEE Trans. Knowl. Data Eng. 2001, 13, 863–873. [Google Scholar] [CrossRef]
- Aggarwal, C.C.; Yu, P.S. A new framework for item-set generation. In Proceedings of the Seventeenth ACM SIGACTSIGMOD-SIGART Symposium on Principles of Database Systems, PODS’98, Seattle, WA, USA, 1–4 June 1998; pp. 18–24. [Google Scholar]
- Agrawal, R.; Imielinski, T.; Swami, A. Mining association rules between sets of items in large databases. In ACM SIGMOD Record; ACM Press: New York, NY, USA, 1993; pp. 207–216. [Google Scholar]
- Agrawal, R.; Srikant, R. Fast algorithms for mining association rules. In Proceedings of the VLDB 1994 Proceedings of the 20th International Conference on Very Large Data Bases, San Francisco, CA, USA, 12–15 September 1994; pp. 487–499. [Google Scholar]
- Mahmood, S.; Shahbaz, M.; Guergachi, A. Negative and positive association rules mining from text using frequent and infrequent item sets. Sci. World J. 2014, 2014, 973750. [Google Scholar] [CrossRef] [PubMed]
- Chen, M.; Han, J.; Yu, P.S. Datamining an overview from database perspective. IEEE Trans. Knowl. Data Eng. 1996, 8, 866–883. [Google Scholar] [CrossRef]
- Ashok Savasere, A.; Omiecinski, E.; Navathe, S. Mining for strong negative associations in a large database of customer transactions. In Proceedings of the Fourteenth International Conference on Data Engineering, Orlando, FL, USA, 23–27 February 1998; pp. 494–502. [Google Scholar]
- Padmanabhan, B.; Tuzhilin, A. A belief-driven method for discovering unexpected patterns. In Proceedings of the Fourth International Conference on Knowledge Discovery and Data Mining (KDD-98), New York, NY, USA, 27–31 August 1998; pp. 94–100. [Google Scholar]
- Han, J.; Yin, Y. Mining Frequent Patterns without candidate generation. ACM SIGMOD Rec. 2000, 29, 1–12. [Google Scholar] [CrossRef]
- Zaki, M.J. Fast Vertical Mining Using Diffsets. In Proceedings of the KDD03: The Ninth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, Washington, DC, USA, 24–27 August 2003. [Google Scholar]
- Wu, X.; Zhang, C.; Zhang, S. Efficient Mining of Positive and Negative Association Rules. ACM Trans. Inf. Syst. 2004, 22, 381–405. [Google Scholar] [CrossRef]
- Daly, O.; Taniar, D. Exception Rules Mining Based On Negative Association Rules. Lect. Notes Comput. Sci. 2004, 3046, 543–552. [Google Scholar] [CrossRef]
- Thiruvady, D.R.; Webb, G.I. Mining Negative Association Rules Using GRD. In Proceedings of the Pacific-Asia Conference on Knowledge Discovery and Data Mining, Sydney, Australia, 26–28 May 2004; pp. 161–165. [Google Scholar] [CrossRef]
- Antonie, M.; Zaiane, O.R. Mining Positive and Negative Association Rules: An Approach for Confined Rules. European Conference on Principles of Data Mining and Knowledge Discovery. Mining Positive and Negative Association Rules An Approach for Confined Rules. In Proceedings of the 8th European Conference on Principles and Practice of Knowledge Discovery in Databases (PKDD04), Pisa, Italy, 20–24 September 2004; pp. 27–38. [Google Scholar] [CrossRef]
- Cornelis, C.; Yan, P.; Zhang, X.; Chen, G. Mining Positive and Negative Association Rules from Large Databases. In Proceedings of the 2006 IEEE Conference on Cybernetics and Intelligent Systems, Bangkok, Thailand, 7–9 June 2006. [Google Scholar] [CrossRef]
- Dong, X.; Sun, F.; Han, X.; Hou, R. Study of Positive and Negative Association Rules Based on multi-confidence and Chi-Squared Test. In Advanced Data Mining and Applications; Li, X., Zaïane, O.R., Li, Z., Eds.; Springer: Berlin/Heidelberg, Germany, 2006; Volume 4093, pp. 100–109. [Google Scholar] [CrossRef]
- Dong, X.; Niu, Z.; Shi, X.; Zhang, X.; Zhu, D. Mining Both Positive and Negative Association Rules from Frequent and Infrequent Itemsets. In Proceedings of the Third International Conference on Advanced Data Mining and Applications (ADMA 2007), Harbin, China, 6–8 August 2007; pp. 122–133. [Google Scholar] [CrossRef]
- Dong, X.; Zheng, Z.; Niu, Z.; Jia, Q. Mining Infrequent Item sets based on Multiple Level Minimum Supports. In Proceedings of the Second International Conference on Innovative Computing, Information and Control, Kumamoto, Japan, 5–7 September 2007. [Google Scholar]
- Dong, X.; Niu, Z.; Zhu, D.; Zheng, Z.; Jia, Q. Mining Interesting Infrequent and Frequent Itemsets Based on MLMS Model. In Proceedings of the International Conference on Advanced Data Mining and Applications, Chengdu, China, 8–10 October 2008; pp. 444–451. [Google Scholar]
- Khairuzzaman, T.S.; Ahmed, C.F.; Jeong, B.; Lee, Y. Mining regular patterns in transactional databases. IEICE Trans. Inf. Syst. 2008, 91, 2568–2577. [Google Scholar]
- Ouyang, W.; Huang, Q. Mining Positive and Negative Sequential Patterns with Multiple Minimum Supports in Large Transaction Databases. In Proceedings of the 2010 Second WRI Global Congress on Intelligent Systems, Wuhan, China, 16–17 December 2010. [Google Scholar] [CrossRef]
- Swesi, I.M.A.O.; Bakar, A.A.; Kadir, A.S.A. Mining Positive and Negative Association Rules from Interesting Frequent and Infrequent Itemsets. In Proceedings of the 2012 9th International Conference on Fuzzy Systems and Knowledge Discovery, Chongqing, China, 29–31 May 2012. [Google Scholar]
- Kumar, N.V.S.P.; Rao, K.R. Mining Positive and Negative Regular Item-Sets using Vertical Databases. Int. J. Simul. Syst. Sci. Technol. 2016, 17, 33.1–33.4. [Google Scholar] [CrossRef]
- Kumar, N.V.S.P.; Rao, L.J.J.; Kumar, G.V. A Study on Positive and Negative Association rule mining. Int. J. Eng. Res. Technol. (IJERT) 2012, 1–4. [Google Scholar]
- Ji, Y.; Ying, H.; Tran, J.; Dews, P.; Mansour, A.; Massanari, R.M. A Method for Mining Infrequent Causal Associations and Its Application in Finding Adverse Drug Reaction Signal Pairs. IEEE Trans. Knowl. Data Eng. 2013, 25, 721–733. [Google Scholar] [CrossRef]
- Bagui, S.; Dhar, P.C. Positive and negative association rule mining in Hadoop’s MapReduce environment. J. Big Data 2019, 6, 75. [Google Scholar] [CrossRef]
- Jiang, H.; Luan, X.; Dong, X. Mining weighted negative association rules from infrequent item sets based on multiple support. In Proceedings of the 2012 International Conference on Industrial Control and Electronics Engineering, Washington, DC, USA, 23–25 August 2012; pp. 89–92. [Google Scholar]
- Kishor, P.; Porika, S. An efficient approach for mining positive and negative association rules from large transactional databases. In Proceedings of the 2016 International Conference on Inventive Computation Technologies (ICICT), Coimbatore, India, 26–27 August 2016. [Google Scholar]
- Ramasubbareddy, B.; Govardhan, A.; Ramamohanreddy, A. Mining positive and negative association rules. In Proceedings of the 5th International Conference on Computer Science and Education, Hefei, China, 24–27 August 2018; pp. 1403–1406. [Google Scholar]
- Sahu, A.K.; Kumar, R.; Rahim, N. Mining negative association rules in a distributed environment. In Proceedings of the 2015 International Conference on Computational Intelligence and Communication Networks, Jabalpur, India, 12–14 December 2015; pp. 934–937. [Google Scholar]
- Brin, S.; Motwani, R.; Silverstein, C. Beyond market basket: Generalizing association rules to correlations. In Proceedings of the 1997 ACM SIGMOD International Conference on Management of Data ACM SIGMOD, Tucson, AZ, USA, 13–15 May 1997; pp. 265–276. [Google Scholar]
- Antonie, L.; Li, J.; Zaiane, O. Negative association rules. In Frequent Pattern Mining; Springer: Cham, Switzerland, 2014; pp. 135–145. [Google Scholar]
- Savasere, A.; Omiecinski, E.; Navathe, S. Mining for strong negative associations in a large database of customer transactions. In Proceedings of the ICDE, Orlando, FL, USA, 23–27 February 1998; pp. 494–502. [Google Scholar]
- Teng, W.-G.; Hsieh, M.-J.; Chen, M.-S. On the mining of substitution rules for statistically dependent items. In Proceedings of the ICDM, Maebashi City, Japan, 9–12 December 2002; pp. 442–449. [Google Scholar]
- Teng, W.G.; Hsieh, M.-J.; Chen, M.-S. A statistical framework for mining substitution rules. Knowl. Inf. Syst. 2005, 7, 158–178. [Google Scholar] [CrossRef]
- Antonie, M.-L.; Zaïane, O.R. Mining Positive and Negative Association Rules: An Approach for Confined Rules; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2004; pp. 27–38. [Google Scholar]
- Thiruvady, D.R.; Webb, G.I. Mining negative rules using GRD. In Advances in Knowledge Discovery and Data Mining; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2004; Volume 3056, pp. 161–165. [Google Scholar]
- Islam, M.S.; Hasan, M.M.; Wang, X.; Germack, H.D.; Noor-E-Alam, M. A Systematic Review on Healthcare Analytics: Application and Theoretical Perspective of Data Mining. Healthcare 2018, 6, 54. [Google Scholar] [CrossRef] [PubMed]
- Khaing, H.W. Data Mining based Fragmentation and Prediction of Medical Data. In Proceedings of the 2011 3rd International Conference on Computer Research and Development, Shanghai, China, 11–13 March 2011; pp. 480–485. [Google Scholar]
- Kaur, S.; Singla, J.; Nkenyereye, L.; Jha, S.; Prashar, D.; Joshi, G.P.; El-Sappagh, S.; Islam, M.S.; Islam, S.M. Medical Diagnostic Systems Using Artificial Intelligence (AI) Algorithms: Principles and Perspectives. IEEE Access 2020, 8, 228049–228069. [Google Scholar] [CrossRef]
- Wei, J.; Lu, Z.; Qiu, K.; Li, P.; Sun, A.H. Predicting Drug Risk Level from Adverse Drug Reactions Using SMOTE and Machine, Learning Approaches. IEEE Access 2020, 8, 185761–185775. [Google Scholar] [CrossRef]
- Lu, Y.; Chen, S.; Zhang, H. Detecting Potential Serious Adverse Drug Reactions using Sequential Pattern Mining Method. In Proceedings of the 2018 IEEE 9th International Conference on Software Engineering and Service Science (ICSESS), Beijing, China, 23–25 November 2018; pp. 56–59. [Google Scholar]
- Lu, Y.; Seidl, T. Towards Efficient, Closed Infrequent Item set Mining using Bi-directional Traversing. In Proceedings of the 2018 IEEE 5th International Conference on Data Science and Advanced Analytics (DSAA), Turin, Italy, 1–3 October 2018; pp. 140–149. [Google Scholar]
- Zhang, J.; Liu, W.; Wang, P. Drug-Drug Interaction Extraction from Chinese Biomedical Literature, using distant supervision. In Proceedings of the 2020 IEEE International Conference on Knowledge Graph (ICKG), Nanjing, China, 9–11 August 2020; pp. 593–598. [Google Scholar]
- Ramaraj, E.; Venkatesan, N. Positive and Negative Association Rule Analysis in Health Care Database. IJCSNS Int. J. Comput. Sci. Netw. Secur. 2008, 8, 325–330. [Google Scholar]
- Antonie, M.L.; Zaïane, O.R. Mining Positive and Negative Association Rules: An Approach for Confined Rules. In Proceedings of the 8th European Conference on Principles and Practice of Knowledge Discovery in Databases, Pisa, Italy, 20–24 September 2004; pp. 27–38. [Google Scholar]
- Antonic, M.L.; Zaiane, O.R. Mining Positive and Negative Association Rules—An approach for confine rules. In European Conference on Principles of Knowledge Discovery; Springer: Berlin/Heidelberg, Germany, 2004; pp. 27–38. [Google Scholar]
- Desai, J.R.; Hyde, C.L.; Kabadi, S.; Louis, M.S.; Bonato, V.; Loomis, A.K.; Galaznik, A.; Berger, M.L. Utilization of Positive and Negative Controls to Examine Comorbid Associations in Observational Database Studies. Med. Care 2017, 55, 244–251. [Google Scholar] [CrossRef]
- Goldacre, M.; Kurina, L.; Yeates, D.; Seagroatt, V.; Gill, L. Use large medical databases to study disease associations. QJM Int. J. Med. 2000, 93, 669–675. [Google Scholar] [CrossRef] [PubMed]
- Kim, Y.; Cho, Y. Predicting Drug–Gene–Disease Associations by Tensor Decomposition for Network-Based Computational drug repositioning. Biomedicines 2023, 11, 1998. [Google Scholar] [CrossRef] [PubMed]
Algorithm Serial Number | Main Author | Interestingness Measures | Occurrence Behavior | Type of Associations | Extension to Mining Technique | Use of Domain Knowledge | |||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Support | Confidence | Correlation | Multi Support | Multi Correlation | Regularity | Irregularity/ Rare | Frequent | Maximal | Unexpected | Positive Associations | Negative Associations | ||||
1 | Ashok Savasere [7] | ✓ | ✓ | ✓ | |||||||||||
2 | Balaji Padmanabhan [8] | ✓ | ✓ | ✓ | ✓ | ||||||||||
3 | Jiawe Han 2000 [9] | ✓ | ✓ | ✓ | FP Tree | ||||||||||
4 | J. Zaki [10] | ✓ | ✓ | ✓ | DI-SET | ||||||||||
5 | Xindong Wu [11] | ✓ | ✓ | ✓ | ✓ | ||||||||||
6 | Daly [12] | ✓ | ✓ | ✓ | Exception rule Mining | ||||||||||
7 | DR Thiruvady [13] | ✓ | ✓ | ✓ | |||||||||||
8 | Maria-Luiza, Antonie [14] | ✓ | ✓ | ✓ | ✓ | ||||||||||
9 | Xiangjun Dong [16] | ✓ | ✓ | ✓ | ✓ | ||||||||||
10 | Tanveer [20] | ✓ | |||||||||||||
11 | Weimin Ouyang [21] | ✓ | Sequential Mining | ||||||||||||
12 | Idheba Mohamad Ali [22] | ✓ | ✓ | ✓ | ✓ | ||||||||||
13 | Pavan NVS [23] | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | ✓ | Veridical Tab |
Chemical | Associated Drug | Chemical | Associated Drug | Chemical | Associated Drug | Chemical | Associated Drug |
---|---|---|---|---|---|---|---|
CH4 | DR3 | CH8 | DR7 | CH11 | DR11 | ||
CH6 | DR3 | CH8 | DR7 | CH11 | DR11 | ||
CH4 | DR3 | CH6 | DR3 | CH8 | DR7 | CH11 | DR11 |
Total Transactions | “%Max Regularity” | “%Support Count” | “Number of Negative Frequent Regular” | “Number of Negative Frequent Regular Maximal and Closed Items” |
---|---|---|---|---|
30,000 | 3 | 2 | 6 | 2 |
3 | 1.75 | 42 | 13 | |
3 | 1.625 | 154 | 46 | |
3 | 1.5 | 461 | 138 | |
30,000 | 2.5 | 1.75 | 41 | 12 |
2.5 | 1.625 | 154 | 46 | |
2.5 | 1.5 | 352 | 106 | |
2.5 | 1.125 | 981 | 294 | |
30,000 | 2 | 1.75 | 35 | 11 |
2 | 1.625 | 118 | 35 | |
2 | 1.5 | 181 | 54 | |
2 | 1.1.25 | 334 | 100 |
At Frequency 1.75 and Recs = 30,000 | |||
---|---|---|---|
Regularity | Number of Negative Frequent Regular Item Sets | Number Negative Frequent Regular Item, Closed and Maximal Item Sets | % Decrease |
3.00 | 6 | 2 | 66.7 |
2.50 | 41 | 12 | 70.7 |
2.00 | 35 | 11 | 68.6 |
Average % of decrease in negative associations | 68.6 |
At Frequency 1.5 and Recs = 30,000 | |||
---|---|---|---|
Regularity | Number Negative Frequent Regular Item Sets | Number Negative Frequent Regular Item, Closed and Maximal Item Sets | % Decrease |
3.00 | 461 | 138 | 70.1 |
2.50 | 352 | 106 | 69.9 |
2.00 | 181 | 54 | 70.2 |
Average | 70.0 |
Total Transactions | %Max Regularity | %Support Count | Number of Negative Frequent Regular | Number of Negative Frequent Regular Maximal and Closed Items | Reduction in Negative Associations | % Reduction |
---|---|---|---|---|---|---|
30,000 | 1.50 | 1.750 | 2 | 0 | 2 | 100 |
1.50 | 1.625 | 3 | 1 | 2 | 67 | |
1.50 | 1.500 | 3 | 1 | 2 | 67 | |
50,000 | 1.65 | 1.65 | 13 | 3 | 10 | 77 |
1.65 | 1.25 | 41 | 10 | 31 | 76 | |
1.65 | 1.00 | 150 | 50 | 100 | 67 | |
70,000 | 1.35 | 1.65 | 35 | 15 | 20 | 57 |
1.35 | 1.35 | 118 | 35 | 83 | 70 | |
1.35 | 1.00 | 181 | 54 | 127 | 70 | |
80, 000 | 1.00 | 0.875 | 35 | 15 | 20 | 57 |
1.00 | 0.815 | 118 | 35 | 83 | 70 | |
1.00 | 0.75 | 181 | 54 | 127 | 70 | |
Average Improvement | 71 |
Frequency | Number Negative Frequent Regular Item Sets | Number Negative Frequent Regular Item, Closed and Maximal Item Sets | % Reduction |
---|---|---|---|
1.750 | 35 | 11 | 70.0 |
1.625 | 118 | 35 | 70.0 |
1.500 | 181 | 54 | 70.0 |
Average | 70.0 |
Frequency | Number Negative Frequent Regular Item Sets | Number Negative Frequent Regular Item, Closed and Maximal Item Sets | % Reduction |
---|---|---|---|
1.650 | 13 | 3 | 0.77 |
1.250 | 41 | 10 | 0.76 |
1.000 | 150 | 50 | 0.67 |
Average | 0.73 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Budaraju, R.R.; Jammalamadaka, S.K.R. Mining Negative Associations from Medical Databases Considering Frequent, Regular, Closed and Maximal Patterns. Computers 2024, 13, 18. https://doi.org/10.3390/computers13010018
Budaraju RR, Jammalamadaka SKR. Mining Negative Associations from Medical Databases Considering Frequent, Regular, Closed and Maximal Patterns. Computers. 2024; 13(1):18. https://doi.org/10.3390/computers13010018
Chicago/Turabian StyleBudaraju, Raja Rao, and Sastry Kodanda Rama Jammalamadaka. 2024. "Mining Negative Associations from Medical Databases Considering Frequent, Regular, Closed and Maximal Patterns" Computers 13, no. 1: 18. https://doi.org/10.3390/computers13010018
APA StyleBudaraju, R. R., & Jammalamadaka, S. K. R. (2024). Mining Negative Associations from Medical Databases Considering Frequent, Regular, Closed and Maximal Patterns. Computers, 13(1), 18. https://doi.org/10.3390/computers13010018