Previous PageTable Of Contents

5. Conclusion

Based on our literature review, experiments, and the results presented in the previous chapter, we conclude that querying a large database of weather observations for past weather cases similar to a present case using a fuzzy k-nearest neighbors (fuzzy k-nn) algorithm that is designed and tuned with the help of a weather forecasting expert can increase the accuracy of predictions of cloud ceiling and visibility at an airport.

We have proposed, implemented, and tested a fuzzy k-nn based prediction system called WIND-1. Its unique component is an expertly-tuned fuzzy k-nn algorithm with a temporal dimension. We tested it with the problem of producing 6-hour predictions of cloud ceiling and visibility at an airport given a database of over 300,000 consecutive hourly airport weather observations (36 years of record). Its prediction accuracy was measured with standard meteorological statistics and compared to a benchmark prediction technique, persistence. In realistic simulations, WIND-1 was significantly more accurate. WIND-1 produced forecasts at the rate of about one per minute.

The fuzzy k-nn based prediction method is significantly more accurate than the non-fuzzy based prediction method. The only variation between the two methods is the nature of the membership functions used to compare attributes of cases. The fuzzy k-nn method uses fuzzy membership functions that span certain ranges around the case being forecast for, whereas the non-fuzzy method uses 0-1-0 functions centered across the same ranges. This suggests that, compared to the accuracy of simple persistence, the significantly higher accuracy of fuzzy k-nn based forecasts is attributable to the use of fuzzy sets to measure similarity as opposed to using crisp sets. To the best of our knowledge, all previous methods used to measure similarity between weather cases have used only crisp sets.

Of significance to case based reasoning: We have shown how fuzzy logic can impart to case-based reasoning the perceptiveness and case-discriminating ability of a domain expert. The fuzzy k-nn technique described in this thesis retrieves similar cases by emulating a domain expert who understands and interprets similar cases. The main contribution of fuzzy logic to case-based reasoning is that it enables us to use common words to directly acquire domain knowledge about feature salience. This knowledge enables us to retrieve a few most similar cases from a large temporal database, which in turn helps us to avoid the problems of case adaptation and case authoring.

The fuzzy k-nn algorithm, even though it is of approximate Order(n) complexity, makes superior predictions with practical speed-with less than one minute of computation. This speed is achieved by strategically ordering the steps in a case-to-case similarity-measuring test and by stopping any test as soon as a step reveals that a case is dissimilar enough to be ruled out of the k-nn set without the need for further tests. For example, suppose we have a database of n past temporal cases. And suppose each case is described by m attributes and is p time units long, thus each case is described by m·p attributes. To measure the similarity of every past case, we would need to perform n·m·p individual tests. However, we are only interested in finding the k most similar cases, and most cases can be ruled out of contention with a single test. So, the number of tests we need to perform is much closer to the order of n than it is to the order of n·m·p.

Of significance to meteorology and the aviation industry: Such a fuzzy k-nn weather prediction system can improve the technique of persistence climatology (PC) by achieving direct, efficient, expert-like comparison of past and present weather cases. PC is a sort of analog forecasting technique that is widely recognized as a formidable benchmark for short-range weather prediction. Previous PC systems have had two built-in constraints: they represented cases in terms of the memberships of their attributes in predefined categories and they referred to a preselected combination of attributes (i.e., defined and selected before receiving the precise and numerous details of present cases). The proposed fuzzy k-nn system compares past and present cases directly and precisely in terms of their numerous salient attributes. The fuzzy k-nn method is not tied to specific categories nor is it constrained to using only a specific limited set of predictors. Such a system for making airport weather predictions will let us tap many, large, unused archives of airport weather observations, ready repositories of temporal cases. This will help to make airport weather predictions more accurate, which will make air travel safer and make airlines more profitable.

We plan to pursue this research and improve the WIND-1 system in the following ways.

References

Aamodt, A., and Plaza, E. (1994) Case-based reasoning: Foundational issues, methodological variations, and system approaches, AI Communications, IOS Press, 7, 39-59.

Abarbanel, H. D. I., Brown, R., Sidorowich, J. J., and Tsimring, L. S. (1993) The analysis of observed chaotic data in physical systems, Reviews of Modern Physics, Vol. 65, No. 4, 1331-1367.

Aha, D. W. (1997) The Omnipresence of case-based reasoning in science and application, Technical Report AIC-98-002). Washington, DC: Naval Research Laboratory, Navy Center for Applied Research in Artificial Intelligence.

Aha, D. W. (1998) Feature weighting for lazy learning algorithms. In Liu, H. and Motoda, H. (Eds.), Feature Extraction, Construction, and Selection: A Data Mining Perspective. Norwell MA: Kluwer, http://www.aic.nrl.navy.mil/papers/1998/AIC-98-003.ps, downloaded September 15, 1999.

Baldwin, J. F., and Martin, T. P. (1995) Fuzzy modelling in an intelligent data browser, Proceedings of the 1995 IEEE International Conference on Fuzzy Systems and the 2nd International Fuzzy Engineering Symposium, IEEE, vol. 5, 1885-1890.

Bardossy, A., Duckstein, L., and Bogardi, I. (1995) Fuzzy rule-based classification of atmospheric circulation patterns, International Journal of Climatology, 15, 1087-1097.

Barry, R. G., and Chorley, R. J., (1968) Atmosphere, Weather and Climate, 4th edition, Methuen & Co. Ltd., New York, New York, USA. 239-240.

Battan, L. J. (1984) Fundamentals of Meteorology, Prentice-Hall, Eaglewood Cliffs, New Jersey, USA, 157-161.

Bezdek, J. C. (1994) Fuzziness vs. probability - Again (!?), IEEE Transactions on Fuzzy Systems, Vol. 2, No. 1, (an editorial article) February 1994, 1-3.

Bezdek, J. C., and Pal, S. K. (eds.) (1992) Fuzzy Models for Pattern Recognition, IEEE Press.

Bonissone, P. P., and Ayub S. (1992) Similarity measures for case-based reasoning systems. In Bouchon_Meunier et al. (eds.) IPMU'92-Advanced Methods in AI: 4th International Conference on Information Processing and Management of Uncertainty in KBS, Palma de Mallorca, July 1992 Proceedings, Spain. Lecture Notes in AI 682, 483-487.

Bonissone, P. P., and Cheetham, W. (1997) Financial applications of fuzzy case-based reasoning to residential property valuation, Proceedings of the 6th IEEE International Conference on Fuzzy Systems. 1997. Vol. 1, 37-44.

Bosc, P. and Pivert, O. (1992) Some approaches for relational databases flexible querying, Journal of Intelligent Information Systems, 1, 323-354.

Bradley, F. M., and Ellis, B. W. (eds.) (1992) Rodale's All-New Encyclopedia of Organic Gardening: The Indispensable Resource For Every Gardener, Rodale Press Inc., Emmaus, Pennsylvania, USA, pgs. 629-630 and 651.

Bridge, D. G. (1998) Defining and combining symmetric and asymmetric similarity measures, Proceedings of the 4th European Workshop On Case-Based Reasoning, 52-63.

Cain, T., Pazzani, M. J., and Silverstein, G. (1991) Using domain knowledge to influence similarity judgement. In Proceedings of the Case-Based Reasoning Workshop, Morgan Kaufmann, Washington, DC, 191-199.

Cats, G., and Wolters, L. (1996) The Hirlam project, IEEE Computational Science and Engineering, Winter 1996, Vol. 3, Issue 4, 4-7. ("Hirlam" is short for High resolution limited area modelling, a co-project of several European countries to develop and maintain a numerical short-range weather forecasting system. Additional information about Hirlam was available on its web site, The international HIRLAM Project, at http://www.knmi.nl/hirlam on September 13, 1999.)

Cheetham, W., and Graf, J. (1997) Case-based reasoning in colour matching, by In, Proc. ICCBR-97, Leake, D. & Plaza, E. (eds.) LNAI, Springer, 1-12.

Christopherson, D. (1998) Artificial intelligence in the weather forecast office-One forecaster's view, Preprints of the 1st Conference on Artificial Intelligence, American Meteorological Society, 136-143.

Clarke, D. A. (1995) Characterizing the causes of low ceiling and visibility at U.S. airports, Preprints of the 6th Conference on Aviation Weather Systems, American Meteorological Society, 325-330.

Conway, B.J. (1989) Expert systems and weather forecasting, Meteorological Magazine, 118, 23-30.

Cox, E. (1992) The seven truths of fuzzy logic, Computer Design, April 1992.

Creecy, R. H., Masand, B. M., Smith, S. J., and Waltz, D. L. (1992) Trading MIPS and memory for knowledge engineering, Communications of the ACM, 35, 48-64.

Cunningham, P., and Bonzano, A. (1999) Knowledge engineering issues in developing a case-based reasoning application, Knowledge-Based Systems, 12 (1999), 371-379.

Dallavalle, J. P., and Dagostaro, V. J. (1995) The accuracy of ceiling and visibility forecasts produced by the National Weather Service, Preprints of the 6th Conference on Aviation Weather Systems, American Meteorological Society, 213-218.

Doswell III, C. A., and Brooks, H. E. (1998) Budget cutting and the value of weather services, Weather and Forecasting, Vol. 13, No. 1, 206-212.

Doyle, C. (1995) Calculating the value of forecast availability at aerodromes in Canada: The potential cost of service withdrawal, Evaluation Network Newsletter, Meteorological Service of Canada, Environment Canada, Toronto, Canada. http://www.tor.ec.gc.ca/ppid/evalnet/eng/nl_i5e.htm#calcu, downloaded September 15, 1999.

Dreyfus, H. L. (1992) What Computers Still Can't Do: A Critique of Artificial Reasoning, The MIT Press, Cambridge, Massachusetts, pgs. 126, 128, 322-323.

Dubois, D., Esteva, F., Garcia, P., Godo, L., de Mantaras, L., and Prade, H. (1997) Fuzzy modelling of case-based reasoning and decision, Case-Based Reasoning Research and Development, Proceedings of the 2nd International Conference on Case-Based Reasoning (ICCBR-97); Leake, D. B., and Plaza, E. (eds.), Springer Verlag, Berlin. 599-610.

Dudani, S. A. (1976) The distance-weighted k-nearest neighbor rule, IEEE Transactions on Systems, Man, and Cybernetics, Volume SMC-6, Number 4, April 1976, 325-327.

Dyer, R. M. and Moninger, W. R. (1988) Summary report in the second workshop on artificial intelligence in the environmental sciences (AIRIES), 15-17 September 1987 Boulder Colorado, Bulletin of the American Meteorological Society, 69, 508-514.

Frankel, D. S., Draper, J. S., Peak, J. E., and McLeod, J. C. (1995) Artificial intelligence needs workshop, 4-5 November 1993, Boston, Massachusetts (meeting summary), Bulletin of the American Meteorological Society, Vol. 76, No. 5, 728-738.

Garner, T. (1995) Use of NGM regional profile forecasts for fog forecasting for space shuttle, landing conditions at KSC [Kennedy Space Center], Preprints of the 6th Conference on Aviation Weather Systems, American Meteorological Society, 189-191.

Gentner, D. and Forbus, K. (1991) MAC/FAC: A model of similarity-based retrieval, Proceedings of the Thirteenth Annual Conference of the Cognitive Science Society, Chicago, IL: Lawrence Earlbaum, 504-509.

Gershon, N. D., and Miller, C. G. (1993) Dealing with the data deluge (Special report: Environment, part 2), IEEE Spectrum, Vol. 30, No. 7, 28-32.

Gollvik, S., and Olsson, E. (1993) NORTAF: Computer generated aerodrome forecasts, 5th International Conference on Aviation Weather Systems, American Meteorological Society, 335-338.

Göös, J., Ala-Siuru, P., and Korpipää, P. (1999) Fuzzy monitoring and case-based diagnosis tool for electronics manufacturing process, Proceedings of the 4th UK Case-Based Reasoning Workshop (UKCBR4), University of Salford, UK. (A copy of this paper was kindly given to me by Janne Göös.)

Hadjimichael, M., Kuciauskas, A. P., Brody, L. R., Bankert, R. L., and Tag, P. M. (1996) MEDEX: A fuzzy system for forecasting Mediterranean gale force winds, Proceedings of FUZZ-IEEE 1996 IEEE International Conference on Fuzzy Systems, 529-534.

Hansen, B. K. (1997) SIGMAR: A fuzzy expert system for critiquing marine forecasts, AI Applications, Vol. 11, No. 1, 59-68.

Hansen, B. K., and Riordan, D. (1998) Fuzzy case-based prediction of ceiling and visibility, Preprints of the 1st Conference on Artificial Intelligence, American Meteorological Society, 118-123.

Hansen, B. K., Bankert, R., and Hadjimichael, M. (eds.) (1999) Fuzzy logic in environmental sciences: A bibliography, http://chebucto.ns.ca/~bjarne/fuzzy_environment, downloaded Sept. 27, 1999.

Hashemi, S. (1999) A review of case-based reasoning, A report for a doctoral directed studies course at the Faculty of Computer Science, DalTech, Dalhousie University, Halifax, Canada (submitted to Professor Denis Riordan).

Huschke, R. E. (editor) (1959) Glossary of Meteorology, American Meteorological Society, Boston, Massachusetts, USA, pgs. 106 and 419.

Jeng, B. C., and Liang, T.-P. (1995) Fuzzy indexing and retrieval in case-based systems, Expert Systems With Applications, Vol. 8., No. 1, 1995. Elsevier Science Ltd., 135-142.

Jones, E., and Roydhouse, A. (1995) Intelligent retrieval of archived meteorological data, IEEE Expert, Vol. 10, No. 6, 50-57.

Kamp, G., Lange, S., and Globig, C. (1998) Related Areas [to CBR. Final chapter]. In Lenz, M., Bartsch-Spörl, B., Burkhard, H.-D., and Wess, S. (eds.), Case-based reasoning technology) From foundations to applications, Springer-Verlag, Berlin Heidelberg, 325-351.

Kantrowitz, M., Horstkotte, E., Joslyn, C. (eds.) (1997) Answers to Questions about Fuzzy Logic and Fuzzy Expert System (Fuzzy Logic FAQ, i.e., Frequently Asked Questions), http://www.cs.cmu.edu/Groups/AI/html/faqs/ai/fuzzy/part1/faq-doc-9.html, downloaded September 11, 1999.

Kasriel, R. H. (1971) Undergraduate Topology, W. B. Saunders Company, Philadelphia, PA, U.S.A., pgs. 81-82.

Keller, J. M., Gray, M. R., and Givens Jr., J. A. (1985) A fuzzy k-nearest neighbor algorithm, IEEE Transactions on Systems, Man, and Cybernetics, Vol. 15, No. 4, 258-263.

Keller, J., Smith, C., and Wilson, F. W. (1995) Applications of column models for terminal weather nowcasts, Preprints of the 6th Conference on Aviation Weather Systems, American Meteorological Society, 66-71.

Kilpinen, J. (1993) The combination of forecasts in an automated aviation weather forecasting system, 5th International Conference on Aviation Weather Systems, American Meteorological Society, 339-340.

Kolodner, J. (1993) Case-Based Reasoning, Morgan Kaufmann, San Francisco, California, USA.

Kontkanen, P., Myllymäki, P., Silander, T., and Tirri, H. (1998) On Bayesian case matching, Advances in case-based reasoning: Proceedings of the Fourth European Workshop on CBR/EWCBR-98, 13-23.

Kosko, B. (1997) Fuzzy Engineering, Prentice Hall, New Jersey, USA.

Kuciauskas, A. P., Brody, L. R., Hadjimichael, M., Bankert, R. L., and Tag, P. M. (1998) MEDEX: Applying fuzzy logic to a meteorological expert system, Preprints of the 1st Conference on Artificial Intelligence, American Meteorological Society, 68-74.

Kumar, V. R., Chung, C. Y. C., and Lindley, C. A. (1994) Toward building and expert system for weather forecasting operations, Expert Systems with Applications, Vol. 7, No. 2, Permagon, Elsevier Science Ltd., 373-381.

Langly, P. (1996) Elements of Machine Learning, Morgen Kaufmann Publishers, San Francisco, California, USA., pp. 381-382.

Leake, D. B. (1996) CBR in context. The present and future; in Leake, D. B. (editor) (1996) Case-Based Reasoning: Experiences, Lessons & Future Directions, American Association for Artificial Intelligence, Menlo Park California, USA, 3-30.

Lefley, M., and Austin, V. (1997) Match'em: Using fuzzy logic to profile criminals. Proceedings of 6th. IEEE conference on fuzzy logic `97 July 1997, Barcelona, Spain.

Leigh, R. J. (1995) Economic benefits of Terminal Aerodrome Forecasts (TAFs) for Sydney Airport, Australia, Meteorological Applications, 2, 239-247.

Liao T. W., Zhang Z., and Mount, C. R. (1998) Similarity measures for retrieval in case-based reasoning systems, Applied Artificial Intelligence, 12, 267-288.

López de Mántaras, R., and Plaza, E. (1997) Case-based reasoning: An overview, AI Communications, 10, 21-29.

Lorenz, E. N. (1963) Deterministic nonperiodic flow, Journal of the Atmospheric Sciences, 20, 130-141.

Lorenz, E. N. (1969a) Three approaches to atmospheric predictability, Bulletin of the American Meteorological Society, 50, 345-349.

Lorenz, E. N. (1969b) Atmospheric predictability as revealed by naturally occurring analogues, Journal of the Atmospheric Sciences, 26, 636-646.

Lorenz, E. N. (1977) An experiment in nonlinear statistical weather forecasting, Monthly Weather Review, 105, 590-602.

Lorenz, E. N. (1993) The Essence of Chaos, University of Washington Press, Seattle, WA, USA.

Luger, G. F., and Stubblefield, W. A. (1998) Artificial Intelligence: Structures and Strategies for Complex Problem Solving, Addison Wesley Longman, Reading, Massachusetts, USA, pg. 238.

Macdonald, K. (1999) Personal correspondence. April 1 1999, Ken Macdonald is Acting Director of the Inter-Agency Services Branch, AES, Environment Canada.

Macdonald, K. (1998) Personal correspondence. July 10, 1998. Ken is Chief of Client Services and Market Management for the Interagency Services Branch of Environment Canada.

Main. J., Dillon, T. S., and Khosla. R. (1996) Use of fuzzy feature vectors and neural vectors for case retrieval in case based systems, NAFIPS 1996 Biennial Conference of the North American Fuzzy Information Processing Society, IEEE, New York, NY, 438-443.

Maner, W., and Joyce, S. (1997) WXSYS: Weather Lore + Fuzzy Logic = Weather Forecasts, presented at 1997 CLIPS Virtual Conference, http://web.cs.bgsu.edu/maner/wxsys/wxsys.htm, downloaded Feb. 27, 1999. (Maner offers the following citation information: "WXSYS," published on the CD-ROM (ISBN 0-534-95053-1) that accompanies Giarratano and Riley, Expert Systems: Principles and Programming, third edition (Boston: PWS Publishing Company, 1998). This program was one of only 6 programs selected for publication after an international competition conducted in 1997 as the CLIPS Virtual Conference. This artificial intelligence program, written in FuzzyCLIPS, may be the first fuzzy-logic expert system to predict local weather based on a compendium of common-sense rules extracted from weather lore. The program was 8673 lines as published, about half code and half documentation, and used 95 fuzzy prediction rules. The publisher required demonstration and validation code, which occupies an additional 3829 lines on the CD-ROM. With Sean Joyce. This paper was one of only 17 papers selected for publication after an international competition conducted in 1997 as the "CLIPS Virtual Conference". Joseph Giarratano and PWS Publishing Company hosted the conference. With Sean Joyce. Fifteen journal pages, PDF format.)

Martin, D. E. (1972) Climatic presentations for short-range forecasting based on event occurrence and reoccurrence profiles, Journal of Applied Meteorology, 11, 1212-1223.

McBratney, A. B., and Moore, A. W. (1985), Application of fuzzy sets to climatic classification, Agricultural and Forest Meteorology, 35, 165-185.

Meadows, N. (1997) TAF tools project-Phase one report, [specially commissioned for the] Interagency Services Report, National Weather Services Directorate, Meteorological Service of Canada, Environment Canada.

Meyer, W. D. (1995) A hybrid model for predicting fog and stratus clouds, Preprints of the 6th Conference on Aviation Weather Systems, American Meteorological Society, 441-444.

Mosher, F. R. (1998) Meteorological craftsman vs. meteorological engineers, Preprints of the 1st Conference on Artificial Intelligence, American Meteorological Society, 164-167.

Munataka, T., and Jani, Y. (1994) Fuzzy systems: An overview, Communications of the ACM, Vol. 37, No. 3, 69-76.

Murtha, J. (1995) Applications of fuzzy logic in operational meteorology, Scientific Services and Professional Development Newsletter, Canadian Forces Weather Service, 42-54.

Patton, R. (1996) Analysis of user requirement for commercial/transport aviation, Forecasting Research Technical Report No. 204, Meteorological Office, Bracknell, Berkshire, United Kingdom.

Pearson, W. R., and Lipman, D. J. (1988) Improved tools for biological sequence comparison, Proceedings of the National Academy of Sciences, Vol. 85, 2444-2448, April, 1988, Biochemistry.

Porter, C. W., and Seaman, N. L. (1995) Short-term high resolution forecasting of cloud ceiling and visibilities, Preprints of the 6th Conference on Aviation Weather Systems, American Meteorological Society, 331-336.

Rafiei, D. (1999) Fourier-Transform Based Techniques in Efficient Retrieval of Similar Time Sequences, Ph.D. thesis, Department of Computer Science, University of Toronto, Ontario, Canada.

Riesbeck, C. K. (1996) What next? The future of CBR in post-modern AI. In Case-Based Reasoning: Experiences, Lessons & Future Directions, Leake, D. B. (editor), American Association for Artificial Intelligence, Menlo Park California, USA, 371-388.

Riesbeck, C. K., and Schank, R. C. (1989) Inside Case-Based Reasoning, Lawrence Erlbaum Associates, Hillside, New Jersey, USA.

Rougegrez, S. (1993) Similarity evaluation between observed behaviors and the prediction of processes. In Wess, S., Althoff, K. D. and Richter, M. (eds.), Topics in Case-Based Reasoning, Proceedings First European Workshop on Case-Based Reasoning, 1993, Springer-Verlag, Berlin, 155-166.

Shakina, N. P., Ivanova, A. R., Sheveleva, O. V., and Kalugina, G. Yu. (1993) Aviation weather forecasting automated methods in the RAFC Moscow and the Airport Vnukovo, 5th International Conference on Aviation Weather Systems, American Meteorological Society, 341-342.

Shivakumar, N., and Garcia-Molina, H. (1995) SCAM: A copy detection mechanism for digital documents, Digital Libraries `95, The Second Annual Conference on the Theory and Practice of Digital Libraries, 155-163.

Silvert, W. (1981) The formulation and evaluation of predictions, International Journal of General Systems, Vol. 7, 189-205.

Singh, S. (1999) A single nearest-neighbor fuzzy approach for pattern recognition, International Journal of Pattern Recognition and Artificial Intelligence, Vol. 13, No. 1, 49-64.

Sivillo, J. K., Ahlquist, J. E., and Toth, Z. (1997) An Ensemble Forecasting Primer, Weather and Forecasting, Vol. 12, No. 4, 809-818.

Srinivasan, D., Liew, A. C., and Chang, C. S. (1994) Forecasting daily load curves using a hybrid fuzzy-neural approach, IEE Proceedings on Generation, Transmission and Distribution, Vol. 141, No. 6, 561-567.

Stanski, H. (1999) Personal correspondence. The commonest cause for TAFs needing to be amended is the occurrence of unforecast categories of cloud ceiling and visibility.

Stanski, H., Leganchuk, A., Hanssen, A., Wintjes, D., Abramowski, O., and Shaykewich, J. (1999) NAV CANADA's TAF amendment response time verification , Eighth Conference on Aviation, Range, and Aerospace Meteorology, 10-15 January 1999, Dallas, Texas, American Meteorological Society, 63-67.

Stern, H., and Parkyn, K. (1999) Predicting the likelihood of fog at Melbourne Airport, Eighth Conference on Aviation, Range, and Aerospace Meteorology, 10-15 January 1999, Dallas, Texas, American Meteorological Society, 174-178.

Sujitjorn, S., Sookjaras, P., Wainikorn, W. (1994) An expert system to forecast visibility in Don-Muang Air Force Base, 1994 IEEE International Conference on Systems, Man and Cybernetics (Humans, Information and Technology) (2-5 Oct., 1994), IEEE, NY, NY, USA, 2528-2531.

Tag, P. M., Hadjimichael, M., Brody, L. R., Kuciauskas, A. P., and Bankert, R. L. (1996) Automating the subjective recognition of 500 mb wind patterns as input to a meteorological forecasting system, Proceedings of the 15th Conference on Weather Analysis and Forecasting, American Meteorological Society, 347-350.

Tobin, K. W., Gleason, S. S., and Karnowski, T. P. (1998) Adaptation of the fuzzy k-nearest neighbor classifier for manufacturing automation, International Society for Optical Engineering Photonics, West Symposium on Electronic Imaging, San Jose, CA. January 22-30, 1998 (http://www-ismv.ic.ornl.gov/publications/kwt_spie98.pdf, downloaded March 6, 2000)

US National Weather Service Aviation Weather Center, URL http://www.awc-kc.noaa.gov/awc/help/what_is_a_taf.html, downloaded Jan. 29, 1999.

Van den Dool, H. M. (1989) A new look at weather forecasting through analogues, Monthly Weather Review, 117, 2230-2247.

Van den Dool, H. M. (1994) Searching for analogues, how long must we wait?, Tellus, 45A, 314-324.

Viot, G. (1993) Fuzzy logic in C, Dr. Dobb's Journal, February 1993 (special issue on "cognitive computing"), pgs. 40-49 and 94.

Vislocky, R. L., and Fritsch, J. M. (1997) An automated, observations-based system for short-term prediction of ceiling and visibility, Weather and Forecasting, 12, 31-43.

Warner, T. T., and Stoelinga (1995) Very-very-short-range numerical simulation of ceiling and visibility during an East Coast snowstorm, Preprints of the 6th Conference on Aviation Weather Systems, American Meteorological Society, 141-143.

Watson, I., and Marir, F. (1994) Case-based reasoning: A review, Knowledge Engineering Review, Vol. 9, No. 4. (http://www.ai-cbr.org/classroom/cbr-review.htm, downloaded November 13, 1998)

Weber-Lee, R., Barcia, R. M., and Khator, S. K. (1995) Case-based reasoning for cash flow forecasting using fuzzy retrieval. Case-Based Reasoning Research and Development; Veloso, M., and Aamodt, A. (eds.), First International Conference on Case-Based Reasoning (ICCBR-95), Sesimbra, Portugal, Springer, 1995.

Whiffen, B. (1993) FTGEN - An automated FT production system, 5th International Conference on Aviation Weather Systems, American Meteorological Society, 327-330.

White, J. F. (1995) Aeronautical meteorology and the International Air Transport Association (IATA), World Meteorological Organization Bulletin, 44 (4), 335-339.

White, P.W. (1987) Impact of weather forecasts on aviation fuel consumption, The Meteorological Magazine, No. 1375, Vol. 115, pg. 29.

Wilson, L. J., and Sarrazin, R. (1989) A classical-REEP [Regression Estimation of Event Probabilities] short-range forecast procedure, Weather and Forecasting, 4, 502-516.

Winder, L., Lefley, M., and Smith, B. (1997) A key for freshwater invertebrates using fuzzy logic. Computer Applications in the Biological Sciences, February 1997.

Xia, B. B. (1997) Similarity Search in Time Series Data Sets, Master of Science thesis, Department of Computer Science, Simon Fraser University, BC, Canada.

Yager, R. R. (1997) Case-based reasoning, fuzzy systems modelling and solutions composition, Case-Based Reasoning Research and Development, Proceedings of the 2nd International Conference on Case-Based Reasoning (ICCBR-97); Leake, D. B., and Plaza, E. (eds.), Springer Verlag, Berlin. 633-642.

Yang, M. S. (1993) A survey of fuzzy clustering, Math. Comp. Modelling, vol. 18, no 11, 1-16.

Zadeh, L. A. (1965) Fuzzy sets, Information and Control, Vol. 8 (June 1965), 338-353; reprinted in Bezdek, J. C., and Pal, S. K. (eds.) (1992) Fuzzy Models for Pattern Recognition, IEEE Press, 35-45.

Zadeh, L. A. (1999) From computing with words to computing with perceptions-from manipulation of measurements to manipulation of perceptions, IEEE Transactions on Circuits and Systems-1: Fundamental Theory and Applications, Vol. 45, No. 1, 105-119.

Zimmerman, H. J. (1991) Fuzzy Set Theory and its Applications (2nd edition), Kluwer Academic Publishers.

Additional References on Analog Forecasting in Meteorology

Here is a list of meteorology papers that describe the method of analog forecasting. All of these papers contributed to our understanding but only a few of these papers are cited specifically in this thesis.

Barnett, T. P., and Preisendorfer, R. W. (1978) Multifield analog prediction of short-term climate fluctuations using a climate state vector, Journal of Atmospheric Science, 35, 1771-1787.

Barnston, A. G., and Livezey, R. E. (1989) An operational multifield analog / anti-analog prediction system for United States seasonal temperatures. Part II: Spring, summer, fall and intermediate 3-month period experiments, Journal of Climate, 2, 513-541.

Bergen, R.E., and Harnack, R. P. (1982) Long-range temperature prediction using a simple analog approach, Monthly Weather Review, 110, 1083-1099.

Guilbaud, S. (1997) Prevision Quantitative des Precipitations Jounalieres par une Method Statistico-Dynamique de Recherche d'Analogues [Prediction of Quantitative Precipitation by a Statistical-Dynamical Method for Finding Analogs], Ph.D Thesis, L'Institute National Polytechnique de Grenoble, France.

Huschke, R. E. (editor) (1959) Glossary of Meteorology, American Meteorological Society, Boston, Massachusetts, USA, pgs. 106 and 419.

Kruizinga, S., and Murphy, A. H. (1983) Use of an analogue procedure to formulate objectively probabilistic temperature forecasts in the Netherlands, Monthly Weather Review, 111, 2244-2254.

Livezey, R. E., and Barnston, A. G. (1988) An operational multifield analog/antianalog system for United States seasonal temperatures. Part 1: System design and winter experiments, Journal of Geophysical Research, Vol. 93, No. D9, 10953-10974.

Lorenz, E. N. (1963) Deterministic nonperiodic flow, Journal of the Atmospheric Sciences, 20, 130-141.

Lorenz, E. N. (1969a:Three approaches to atmospheric predictability, Bulletin of the American Meteorological Society, 50, 345-349.

Lorenz, E. N. (1969b) Atmospheric predictability as revealed by naturally occurring analogues, Journal of the Atmospheric Sciences, 26, 636-646.

Lorenz, E. N. (1977) An experiment in nonlinear statistical weather forecasting, Monthly Weather Review, 105, 590-602.

Lorenz, E.N. (1993) The Essence of Chaos, University of Washington Press, Seattle, WA, USA.

Martin, D. E. (1972) Climatic presentations for short-range forecasting based on event occurrence and reoccurrence profiles, Journal of Applied Meteorology, 11, 1212-1223.

Nicolis, C. (1998) Atmospheric Analogs and Recurrence Time Statistics: Toward a Dynamical Formulation. Journal of the Atmospheric Sciences, Vol. 55, No. 3, 465-475.

Radinivic, D. (1975) An analogue method for weather forecasting using the 500/1000 mb relative topography, Monthly Weather Review, 103, 639-649.

Roebber, P. J., Bosart, L. F. (1998) The Sensitivity of Precipitation to Circulation Details. Part I: An Analysis of Regional Analogs, Monthly Weather Review, Vol. 126, No. 2, 437-455.

Ruosteenoja, K. (1988) Factors affecting the occurrence and lifetime of 500 mb height analogues: A study based on a large amount of data, Monthly Weather Review, 116, 368-376.

Soucy, D. (1991) Revised users guide to days 3-4-5 automated forecast composition program, CMC Technical Document No. 37, Canadian Meteorological Centre, Environment Canada.

Toth, Z. (1989) Long-range weather forecasting using an analog approach, Journal of Climate, 2, 594-607.

Toth, Z. (1991a) Intercomparison of circulation similarity measures, Monthly Weather Review, 119, 55-64.

Toth, Z. (1991b) Estimation of the atmospheric predictability by circulation analogs, Monthly Weather Review, 119, 65-72.

Toth, Z. (1991c) Circulation patterns in phase space: A multinormal distribution?, Monthly Weather Review, 119, 1501-1511.

Van den Dool, H. M. (1994) Searching for analogues, how long must we wait?, Tellus, 45A, 314-324.

Van den Dool, H. M. (1989) A new look at weather forecasting through analogues, Monthly Weather Review, 117, 2230-2247.

Van den Dool, H. M. (1987) A bias skill in forecasts based on analogues and antilogues, Journal of Applied Meteorology, 26, 1278-1281.

Van den Dool, H. M. (1989) A new look at weather forecasting through analogues, Monthly Weather Review, 117, 2230-2247.

Woodcock, F. (1980) On the use of analogues to improve regression forecasts, Monthly Weather Review, 108, 292-297.

Appendix A: Sample Questionnaire for Knowledge Acquisition

Knowledge acquisition is performed simply by having the expert fill in a questionnaire such as the one shown below. When such a questionnaire is completed, it contains all the information needed to construct the fuzzy sets and to order the fuzzy operations that shown above in Section 3.2 (pg. 78). The grayed-out fields would normally be blank but we have inserted sample values for the WIND-1 configuration.

Part A: Attributes and order of comparison

Specify the attributes to compare and the order in which they are to be compared.

date of the year, hour of the day, cloud amount, cloud ceiling height, visibility, wind direction, wind speed, precipitation type, precipitation intensity, dew point temperature, dry bulb temperature, pressure trend

Part B: Continuous-number attributes

List the continuous-number attributes-those with values that can be positive or negative and which are compared in terms of their relative difference-and for each, specify values of difference that signify slightly near, near, and very near. If you choose to fill in only the middle column, then, by default, the threshold for slightly near will be twice that for near, and the threshold for very near will be half that for near.

Attribute

slightly near

near

very near

date of the year

60 days

30 days

10 days

hour of the day

2 hours

1 hours

0.5 hours

wind direction

40 degrees

20 degrees

10 degrees

dew point temperature

4 degrees

2 degrees

1 degree

dry bulb temperature

8 degrees

4 degrees

2 degree

pressure trend

0.20 kPa · hr -1

0.10 kPa · hr -1

0.05 kPa · hr -1

Part C: Absolute-number attributes

List the absolute-number attributes-those with values that can be only equal to zero or positive numbers, and which are compared in terms of their relative magnitudes-and for each possible pair, use numbers in the range (0.0...1.0] to specify how near they are to each other. The number 0.25 corresponds to slightly near, the number 0.50 corresponds to near, and the number 0.75 corresponds to very near.

Attribute

             

wind speed

0

1.00

         
 

1

0.75

1.00

       
 

2

0.50

0.75

1.00

     
 

3

0.25

0.50

0.75

1.00

   
 

4

0.10

0.25

0.50

0.75

1.00

 
 

...

...

...

...

...

...

...

   

0

1

2

3

4

...

               

cloud amount

...

           
               

cloud ceiling height

...

           
               

visibility

...

           

Part D: Nominal attributes

List the nominal attributes, and for each possible pair, use numbers in the range (0.0...1.0] to specify how near they are to each other The number 0.25 corresponds to slightly near, the number 0.50 corresponds to near, and the number 0.75 corresponds to very near. 68

Attribute

           

precipitation type

Nil

1.00

       
 

Drizzle

0.02

1.00

     
 

Showers

0.03

0.50

1.00

   
 

Rain

0.01

0.50

0.75

1.00

 
 

...

...

...

...

...

 
   

Nil

Drizzle

Showers

Rain

...

             

precipitation intensity

...

         

Part E: Recency

For each possible time step in comparable temporal cases, use numbers in the range (0.0...1.0] to specify the lowest level of similarity that can be attributed to attributes from that time step. The number 0.25 corresponds to slightly near, the number 0.50 corresponds to near, and the number 0.75 corresponds to very near.

 

Time

in case

Minimum

similarity

Time

in case

Minimum

similarity

 

...

...

t-0

0.00

 

t-12

0.96

t+1

0.00

 

t-11

0.95

t+2

0.00

 

t-10

0.94

t+3

0.00

 

t-9

0.93

t+4

0.00

 

t-8

0.92

t+5

0.00

 

t-7

0.91

t+6

0.00

 

t-6

0.90

t+7

0.00

 

t-5

0.80

t+8

0.00

 

t-4

0.70

t+9

0.00

 

t-3

0.60

t+10

0.00

 

t-2

0.40

t+11

0.00

 

t-1

0.20

t+12

0.00

     

...

...

Appendix B: A Worked-out Example of Fuzzy k-nn Algorithm for Prediction

We hope that the results achieved by the fuzzy k-nn algorithm are reproducible, in weather prediction and in other applications. Towards that end, this appendix presents a step-by-step, worked-out example of the fuzzy k-nn algorithm described in Section 3.2 (pg. 78).

We begin by assuming the similarity-measuring function has been configured as explained in Section 3.2 and in Appendix A. The three main processes in using the algorithm are as follows.

The first process, is the most original, so this appendix presents a detailed example of how similarity of cases is calculated. The following example pertains to weather but it technique ought to generalize to any sort of application which describes temporal cases in terms of continuous, absolute, and nominal attributes.

Measure similarity

A present case is composed of actual weather observations are shown in Figure 33.

CYHZ 120000Z 19010G16KT 1 1/2SM SHRA BR OVC007 21/20 A2990 RMK SF8 RERA SLP125=

CYHZ 120100Z 20011KT 1 1/2SM -SHRA BR BKN003 OVC007 21/20 A2991 RMK SF5SF3 SLP127=

CYHZ 120046Z 20012KT 3SM -SHRA BR OVC007 RMK SF8 RERA=

CYHZ 120200Z 23011KT 8SM BKN004 OVC011 21/20 A2992 RMK SF5SC3 SLP130=

CYHZ 120249Z 23008KT 2SM BR OVC007 RMK SF8 CIG RGD=

CYHZ 120300Z 23008KT 2SM BR OVC005 21/20 A2994 RMK SF8 SLP139=

CYHZ 120400Z 21009KT 10SM FEW007 OVC076 20/19 A2993 RMK SF2AC6 SLP135=

CYHZ 120500Z 32006KT 10SM BKN010 OVC075 19/18 A2995 RMK SC5AC3 SLP140=

CYHZ 120600Z 32005KT 12SM BKN008 OVC210 19/18 A2996 RMK SC6CI2 SLP143=

CYHZ 120700Z 33005KT 15SM FEW008 BKN250 18/17 A2997 RMK SF1CI1 SLP148=

CYHZ 120800Z 33006KT 15SM VCFG SKC 17/16 A2997 RMK VSBY NW 1/2 SLP149=

CYHZ 120900Z 31007KT 10SM PRFG FEW100 BKN250 16/16 A2998 RMK AC1CI1 VSBY LWR W SLP152=

CYHZ 121000Z 29005KT 12SM BKN250 16/15 A3001 RMK CI5 SLP161=

CYHZ 121100Z 29009KT 12SM BKN250 16/15 A3002 RMK CI5 SLP166=

CYHZ 121200Z 29006KT 15SM BKN250 16/14 A3003 RMK CI5 SLP170=

Figure 33. Actual weather observations (METAR code) for Halifax International Airport for the period 00:00 to 12:00 UTC 12 September 1999 (obtained from the Texas A&M Weather Interface website, http://www.met.tamu.edu/personnel/students/weather/weather_interface.html, downloaded September 12, 1999).

Three simplified weather cases are shown in Figure 34. Case 1 represents the present case to predict for; it is drawn from the data above in Figure 33. Cases 2 and 3 represent two analogs from the weather archive to make predictions from; they are hypothetical. For purposes of illustration, only seven-hour-long cases are considered and only three weather attributes are presented: cloud ceiling, wind direction, and precipitation type; these attributes are, respectively, absolute, continuous, and nominal (as described in Section 3.2, pg. 78). Longer cases with more attributes would be handled by straightforward extension of the technique shown.

 

case 1

case 2

case 3

time

cloud
ceiling
(dam)

wind
dirn.
(deg.)

pcpn.

cloud
ceiling
(dam)

wind
dirn.
(deg.)

pcpn.

cloud
ceiling
(dam)

wind
dirn.
(deg.)

pcpn.

t-3

9

200

shwrs

12

190

rain

9

170

drzl

t-2

12

230

nil

15

220

nil

9

210

nil

t-1

15

230

nil

21

220

nil

12

220

nil

t-0

21

210

nil

30

220

nil

15

210

nil

t+1

(30)

320

nil

24

330

nil

21

310

nil

t+2

(24)

320

nil

30

330

nil

24

310

nil

t+3

(999)

330

nil

999

340

nil

750

320

nil

Figure 34. Present case (1) and two analogs (2 and 3). Present case is weather from Halifax International Airport for the period 01:00 to 07:00 UTC 14 September 1999. Analogs are contrived for illustration purposes. The t-0 observation corresponds to a forecast start time of 04:00 UTC. In a forecast setting, the grayed-out observations in case 1 are not known, however auxiliary predictors (guidance) for the values of wind direction and precipitation are commonly available.

The three attributes presented Figure 34 are sufficient to demonstrate each of fuzzy similarity-measuring operations described in Section 3.2, namely m a, m c, m n, and m f(t) as shown in Figure 35 and Figure 36.

time

ceiling

wind dirn.

pcpn.

 

case 1

case 2

m a2

case 1

case 2

mc2

case 1

case 2

mn2

t-3

9

12

0.75

200

190

0.88

shwrs

rain

0.75

t-2

12

15

0.80

230

220

0.88

nil

nil

1.00

t-1

15

21

0.71

230

220

0.88

nil

nil

1.00

t-0

21

30

0.70

210

220

0.88

nil

nil

1.00

t+1

?

240

-

320

330

0.88

nil

nil

1.00

t+2

?

300

-

320

330

0.88

nil

nil

1.00

t+3

?

999

-

330

340

0.88

nil

nil

1.00

(a) Comparing case 2 to case 1, ma2 is the similarity between their absolute values of ceiling height, mc2 is the similarity between their continuous values of wind direction and mn2 is the similarity between their nominal types of precipitation.

time

ceiling

wind dirn.

pcpn.

 

case 1

case 3

ma3

case 1

case 3

mc3

case 1

case 3

mn3

t-3

9

9

1.00

200

170

0.38

shwrs

drzl

0.50

t-2

12

9

0.75

230

210

0.50

nil

nil

1.00

t-1

15

12

0.80

230

220

0.88

nil

nil

1.00

t-0

21

15

0.71

210

210

1.00

nil

nil

1.00

t+1

?

21

-

320

310

0.88

nil

nil

1.00

t+2

?

24

-

320

310

0.88

nil

nil

1.00

t+3

?

750

-

330

320

0.88

nil

nil

1.00

(b) Comparing case 3 to case 1, ma3 is the similarity between their absolute values of ceiling height, mc3 is the similarity between their continuous values of wind direction, and mn3 is the similarity between their nominal types of precipitation.

Figure 35. Similarity measurement between a present case (1) and two past cases (2 and 3). The fuzzy operations ma, mc, and mn are described in Section 3.2 (pg. 78).

The grayed-out values of wind direction and precipitation for the future parts of the present case (case 1) in Figure 35 are prevision obtained from auxiliary predictors, such as computer models or humans. As explained earlier, existing methods forecast large-scale phenomena, such as wind and precipitation, more effectively than they forecast small-scale phenomena, such as cloud ceilings at a particular airport.

 

ceiling

wind dirn.

pcpn.

t

ma2

mf(t)

maxa2

mc2

mf(t)

maxc2

mn2

mf(t)

maxn2

-3

0.75

0.60

0.75

0.88

0.60

0.88

0.75

0.6

0.75

-2

0.80

0.40

0.80

0.88

0.40

0.88

1.00

0.4

1.00

-1

0.71

0.20

0.71

0.88

0.20

0.88

1.00

0.2

1.00

-0

0.70

0.00

0.70

0.88

0.00

0.88

1.00

0.0

1.00

1

     

0.88

0.00

0.88

1.00

0.0

1.00

2

     

0.88

0.00

0.88

1.00

0.0

1.00

3

     

0.88

0.00

0.88

1.00

0.0

1.00

   

min =

0.70

 

min =

0.88

 

min =

0.75

                   
 

min { maxa2, maxc2, maxn2 } = min { 0.70, 0.88, 0.75 } =

0.70

(a) For case 2, the "min of the maxes" equals 0.70, due to a dissimilarity between cloud ceilings at time t-0. Assign to case 2 this value of similarity to case 1.

 

ceiling

wind dirn.

pcpn.

t

ma3

mf(t)

maxa3

mc3

mf(t)

maxc3

mn3

mf(t)

maxn3

-3

1.00

0.60

1.00

0.38

0.60

0.60

0.50

0.6

0.60

-2

0.75

0.40

0.75

0.50

0.40

0.50

1.00

0.4

1.00

-1

0.80

0.20

0.80

0.88

0.20

0.88

1.00

0.2

1.00

-0

0.71

0.00

0.71

1.00

0.00

1.00

1.00

0.0

1.00

1

     

0.88

0.00

0.88

1.00

0.0

1.00

2

     

0.88

0.00

0.88

1.00

0.0

1.00

3

     

0.88

0.00

0.88

1.00

0.0

1.00

   

min =

0.71

 

min =

0.50

 

min =

0.60

                   
 

min { maxa3, maxc3, maxn3 } = min { 0.71, 0.50, 0.60 } =

0.50

(b) For case 3, the "min of the maxes" equals 0.50, due to a dissimilarity between their wind directions at time t-2. Assign to case 3 this value of similarity to case 1.

Figure 36. Raise old low values of similarity-in effect, "forget" old dissimilarities with mf(t). Then determine the minimum of the maximum of all the similarities between past case and present case. The fuzzy operation mf is described in Section 3.2 (pg. 78).

The just described process of similarity measurement of temporal cases is the most complicated process in the fuzzy k-nn algorithm for prediction. The subsequent two process are relatively simple and explained briefly as follows.

Traverse case base

Traverse the case base measuring the similarity between past cases and a present case and simultaneously maintain a linked list of the k most similar cases (such as is shown in Figure 18 on pg. 88). Make every case-to-case similarity measuring process only as detailed as necessary. If initial attribute-to-attribute tests imply strong dissimilarity between cases-sufficient to exclude the past case in question form the k-nn set-then terminate the similarity measurement process for that past case and proceed to the next past case.

Make predictions based on a weighted median of the k-nn

For purposes of illustration, we assume that we sought only two analogs for the present case, that is, k = 2. The weighted median calculation easily extends to higher values of k.

Figure 36 shows that between case 2 and case 1 the degree of similarity equals 0.70, and between case 3 and case 1 the degree of similarity equals 0.50. Hence, a prediction for case 1 should consist of such proportional parts of case 2 and case 3, as shown in Figure 37.

time

case 2

case 3

 

prediction

(actual)

t+1

(0.7 * 24

+ 0.5 * 21)

/ (.7+.5) =

23

(30)

t+2

(0.7 * 30

+ 0.5 * 24)

/ (.7+.5) =

28

(24)

t+3

(0.7 * 999

+ 0.5 * 750)

/ (.7+.5) =

895

(999)

Figure 37. Prediction based on weighted median of k-nn (k = 2).

68 The nominal attribute table shows the "distance" between any two nominal attributes in the same way that a distance table on a highway map shows distances between towns. For illustration purposes, the table describes fuzzy relationships between only four common precipitation types. The actual WIND-1 configuration table specifies fuzzy relationships between 24 possible precipitation types (e.g, freezing rain, ice pellets, etc.).

Previous PageTable Of Contents