EP0335696A2 - Pattern recognition apparatus - Google Patents

Pattern recognition apparatus Download PDF

Info

Publication number
EP0335696A2
EP0335696A2 EP89303116A EP89303116A EP0335696A2 EP 0335696 A2 EP0335696 A2 EP 0335696A2 EP 89303116 A EP89303116 A EP 89303116A EP 89303116 A EP89303116 A EP 89303116A EP 0335696 A2 EP0335696 A2 EP 0335696A2
Authority
EP
European Patent Office
Prior art keywords
pattern
detailed
input
characteristic data
patterns
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP89303116A
Other languages
German (de)
French (fr)
Other versions
EP0335696A3 (en
Inventor
Yoshiaki C/O Patent Division Kurosawa
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Toshiba Corp
Original Assignee
Toshiba Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Toshiba Corp filed Critical Toshiba Corp
Publication of EP0335696A2 publication Critical patent/EP0335696A2/en
Publication of EP0335696A3 publication Critical patent/EP0335696A3/en
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/24Character recognition characterised by the processing or recognition method
    • G06V30/248Character recognition characterised by the processing or recognition method involving plural approaches, e.g. verification by template match; Resolving confusion among similar patterns, e.g. "O" versus "Q"

Definitions

  • the present invention is directed to computer-­assisted systems which automatically recognize an input pattern such as a picture image, voices and characters, and relates particularly to a pattern recognition apparatus which recognizes handwritten character pat­terns using a structural analysis method.
  • a structural analysis method executes pat­tern recognition by detecting a contour pattern of an input character, dividing the detected contour line into a plurality of partial patterns (normally called "segments") and searching the structure of the input character based on the shapes or characteristics of these partial patterns.
  • Such a method is effective in recognizing, with high accuracy, strongly distorted characters which are freely handwritten by people.
  • a pattern recognition apparatus employing a struc­tural analysis method uses, as reference data (reference segments), characteristic data of standard segment sha­pes that are prepared by executing a statistical analy­sis on standard contour patterns of characters registered for the individual categories.
  • the reference segments are significant in characterizing the shape of a character. These reference segments are registered in a main reference section.
  • To recognize the pattern of an input character first, the contour pattern of this input character is detected, followed by segmentation of the detected contour pattern, thereby providing partial contour segments of the input character pattern.
  • Each contour segment is sequentially compared, and collated, with segments corresponding to one reference character pattern by a matching unit.
  • pattern recognition for the input character is said to be successful.
  • the matching process if the input does not uniquely match with a single reference, the next reference character pattern will be selected and comparison and collation between the input and the selected pattern will be carried out; this process is repeated until pattern matching succeed.
  • registered reference characters are typically classified into several categories. For instance, English characters are generally classified into alphabets and numerals. With such a simple reference structure, however, if an input character pattern is intricate, it may be undesirably and erro­neously matched with a reference character that belongs to a different category, but yet has a high similarity in shape. This impairs the accuracy of pat­tern recognition. Given that an input character is "1" (alphanumeral), if it is discriminated to be matched with a reference character belonging to a different category, such as "7" (also an alphanumeral), due to their similarity in shape, the pattern recognition pro­cess for the input would fail. Failure in pattern recognition of a handwritten character pattern deteriorates the accuracy of the recognition result and impairs the reliability of this pattern recognition pro­cess.
  • each reference segment of each reference character is added with detailed sub-data such as the length, curvature and positional rela­tionship betwen adjoining segments.
  • a detailed pattern recognition process is executed using the detailed recognition program to thereby determine which one of the probable categories is correct.
  • the reference data structure is very intricate and rewriting or updating the contents of the reference is not an easy task for operators.
  • the detailed recognition program added to each segment of that reference character should also be updated every time.
  • Such an updating work is troublesome to operators and it is significantly difficult to self-manage the correspon­dence or corresponding relationship between old and new detailed recognition programs existing respectively before and after updation.
  • a specific pattern recognition apparatus which comprises: a segmentation unit for receiving an input pattern and segmenting the input pattern into partial patterns; a characteristic extraction unit for extracting characteristics of each of the partial patterns as input pattern characteristic data; reference unit for storing characteristics of partial patterns of each of standard patterns as reference characteristic data; and a processor unit, coupled to the characteristic extraction unit and the reference unit, for executing a matching process involving comparison of the input pattern characteristic data with the reference charac­teristic data to find out, from the reference unit, that standard pattern with which the input pattern is matched.
  • the reference unit has first and second reference sections.
  • the first reference section stores the stan­dard patterns and identification marks for identifying a specific one of the partial patterns of the standard patterns.
  • the second reference section stores infor­mation for investigating detailed characteristics of each of the standard patterns.
  • the processor unit uses the detailed characteristic infor­mation to finally identify one standard pattern which is correctly matched with the input pattern from among the matched standard patterns.
  • a pattern recognition appara­tus according to one preferred embodiment of this inven strictlytion is generally designated by reference numeral "10."
  • This apparatus automatically recognizes handwritten character input patterns, for example, but it is not restricted to such an input; the input may be a graphic image or a voice.
  • An input pattern is scanned by, and subjected to the necessary processes in, an input unit 12, and is then supplied to a contour segmentation unit 14.
  • This unit 14 detects the contour line of the input character pattern and segments the detected contour line into a plurality of partial patterns (called “contour segments” or simply “segments”).
  • the segment data is transferred to a characteristic extraction unit 16 where it is sub­jected to characteristic extraction.
  • the characteristic extraction unit 16 produces characteristic data of each segment of the input character pattern, which data is supplied to a matching processor 18.
  • a reference unit 20 is coupled to the matching processor 18, and includes a main reference 22 which is associated directly with the matching processor 18.
  • the main reference 22 stores segments (reference segments) of each of standard or reference character patterns. These reference character patterns are stored, classified for the individual character categories, in the main reference 22.
  • the matching processor 18 exe­cutes pattern recognition in such a manner that it com­pares and collates the input pattern with a certain reference pattern to identify that reference pattern which is matched with the input pattern with high simi­larity.
  • the matching processor 18 is coupled to a detailed recognition unit 24 which is associated with a detailed matching reference 26 included in the reference unit 20.
  • a detailed recognition unit 24 When matching between the input pattern and a reference reference pattern is partially or entirely successful in the matching process executed by the matching processor 18, then the detailed recog­nition unit 24 becomes operative.
  • the detailed recog­nition unit 24 performs a detailed pattern recognition process on the input pattern based on data or a pro­gram described in the detailed matching reference 26.
  • the final recognition result attained by the detailed recognition unit 24 is output from an output unit 28.
  • Fig. 2 illustrates a model status of reference pattern stored in the main reference 22.
  • reference character patterns hereinafter referred to as “category”
  • C1, C2, ..., Cn partial patterns (referred to as “reference segments”) are stored as characteristic data A, B, C, ..., Z in the main reference 22.
  • Those of the reference segments A, B, C, ..., Z which are used in a detailed pattern recognition process in the detailed recognition unit 24, are affixed with identification marks as shown in Fig. 2.
  • the identification mark No. 4 is affixed to the characteristic data of the reference segment B, and the identification mark No. 3 to the characteristic data of the reference segment Z.
  • the detailed matching reference 26 stores such a description as "Specify segments (X, Y) of the input pattern respectively corresponding to identification marks No. 3 and No. 4 and make a detailed discrimination using the correspondence between them" which is used in comparison and collation between the input pattern and the reference pattern C1.
  • the detailed recognition unit 24 specifies the input segments X and Y corresponding to the reference segments B and Z of the reference pattern having the identification marks No. 3 and No. 4 and executes the detailed recognition process using the correspondence between these segments.
  • the description in the detailed matching reference 26, concerning the use of the above marks, is automatically transferred to the detailed recognition unit 24 at the same time the detailed recognition process is initiated, thus ensuring smooth operation of the detailed recognition unit 24.
  • the contour segmentation unit 14 detects the general contour line of the input pattern and segments the detected con­tour pattern into partial contour lines a to h as shown in Fig. 3B.
  • These partial contour lines are hereinafter called “partial patterns” or “input segments” each of which is accompanied by its characteristic data such as position data, length data and vector data indicating the tangential direction.
  • Reference pattern characteristic data defining the sequence of reference segments as shown in Fig. 4A is registered in the main reference 22; the alphabets “e,” “s” and “c” included in the reference characteristic data represents data about the characteristics of the partial contours. For instance, “e” indicates that the partial contour is the end, “s” the middle linear por­tion, and “c” the middle curved portion.
  • the code data sequence of the reference segment characteristic data, "escsescs,” shown in Fig. 4A may result in that it indi­cates the character characteristic of both the character pattern "1" (Fig. 4B) and the character pattern "7" (Fig. 4C).
  • the matching processor 18 outputs such a pattern recognition result that the input pattern is matched with the reference patterns "1" and "7". This recognition result is supplied to the detailed recognition unit 24 located on the next stage.
  • the detailed recognition unit 24 executes a final recognition such that it performs a detailed recognition process on the recognition result from the matching processor 18 and discriminates whether the input pattern is the reference pattern "1" or "7". At this time, the detailed recognition process is executed in accor­dance with the description about the use of the marks stored in the detailed matching reference 26.
  • the description stored in the detailed matching reference 26 includes the following as detailed recognition data:
  • the detailed recognition unit 24 executes the detailed recognition process comprising the following steps with respect to the corresponding segments of the input character pattern:
  • the detailed recognition unit 24 can effectively search the reference segments which are used to compute the values of the detailed charac­teristic data X, Y and A necessary for executing the detailed recognition process. This can ensure effective extraction of the characteristic data of the partial contour lines of the input pattern, i.e., the input segments, thereby improving the efficiency of the detailed recognition process.
  • Fig. 6 illustrates a flowchart for the final pattern recognition process and computation of the values of the detailed recognition data X, Y, V and A.
  • a program that defines the rules for the process sequence is stored as firmware in the detailed matching reference 26.
  • the detailed recognition unit 24 is designed such that using the detailed recognition data stored in the detailed matching reference 26, the unit 24 additionally performs a secondary pattern recognition or a detailed recognition process with respect to the matching result attained by the primary pattern recogni­tion executed by the matching processor 18 using the main reference 22. Accordingly, the accuracy in recognizing an input pattern can be improved.
  • the matching reference unit 20 has the main reference 22 for storing reference segment characteristic data of standard patterns and the detailed matching reference 26 for storing detailed characteristic data and a detailed recognition program.
  • Marks #P are added to specific segments to identify them, which are included in the segments of each reference pattern stored in the main reference 22 and are necessary to compute the detailed recognition data in a detailed recognition pro­cess.
  • the main reference 22 for example, the reference pattern corresponding to a certain reference character, the correspondence be­tween the reference segments before updation and those after updation can be easily and clearly made using the marks #P.
  • the detailed recognition data and the content of the detailed recognition program, both stored in the detailed matching reference 26 can be still effec-tively used even after updation of a reference pattern.
  • Unnecessariness to alter the con­tents of the detailed matching reference 26 in updating the contents of the main reference 22 permits operators to easily and effectively deal with the contents of the matching reference unit 20. This can, therefore, faci­litate updation of the reference contents of a pattern recognition apparatus capable of performing pattern recognition with high accuracy and can improve the management and maintenance of the reference unit by operators.
  • the method for segmenting an input pattern to be recognized into partial patterns, the characteristic extraction method, the matching method and the specific method for affixing identification marks, all employed in the present pattern recognition apparatus may be basically changed to other existing methods.
  • the storage format of the reference patterns is not restricted to the above-described speci­fic format, but may be modified in various forms.
  • the above pattern recognition technique can be applied not only to characters but also other types of input patterns such as picture images and voices.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Character Discrimination (AREA)
  • Image Analysis (AREA)

Abstract

A pattern recognition apparatus has a contour segmentation unit (14) for dividing an input pattern into segments, a characteristic extraction unit (16) for extracting characteristics of the input segments, and a reference unit (20) for storing characteristic data of reference patterns. The reference unit (20) includes a main reference (22) and a detailed matching reference (26). The main reference (22) stores partial pattern characteristic data representing the characteristics of segments of each reference pattern. The detailed matching reference (26) stores detailed characteristic data of each reference pattern together with a program for specifying an operation procedures thereof. A matching processor (18) sequentially compares and colla­tes the input pattern with the reference patterns to find out that standard pattern with which the input pat­tern is matched with the highest similarity. When the input pattern is matched with several reference pat­terns, a detailed recognition unit (24) performs a detailed recognition process using the detailed charac­teristic data of these reference patterns to finally select the correct one from among the reference pat­terns. The main reference (22) additionally stores identification marks to identify specific reference segments necessary to acquired the above detailed characteristic data.

Description

  • The present invention is directed to computer-­assisted systems which automatically recognize an input pattern such as a picture image, voices and characters, and relates particularly to a pattern recognition apparatus which recognizes handwritten character pat­terns using a structural analysis method.
  • Today, various methods have been developed as auto­matic pattern recognition methods for handprinted characters. A structural analysis method executes pat­tern recognition by detecting a contour pattern of an input character, dividing the detected contour line into a plurality of partial patterns (normally called "segments") and searching the structure of the input character based on the shapes or characteristics of these partial patterns. Such a method is effective in recognizing, with high accuracy, strongly distorted characters which are freely handwritten by people.
  • A pattern recognition apparatus employing a struc­tural analysis method uses, as reference data (reference segments), characteristic data of standard segment sha­pes that are prepared by executing a statistical analy­sis on standard contour patterns of characters registered for the individual categories. The reference segments are significant in characterizing the shape of a character. These reference segments are registered in a main reference section. To recognize the pattern of an input character, first, the contour pattern of this input character is detected, followed by segmentation of the detected contour pattern, thereby providing partial contour segments of the input character pattern. Each contour segment is sequentially compared, and collated, with segments corresponding to one reference character pattern by a matching unit. When all the segments of one input character are matched with all the reference segments of a reference character, pattern recognition for the input character is said to be successful. In the matching process, if the input does not uniquely match with a single reference, the next reference character pattern will be selected and comparison and collation between the input and the selected pattern will be carried out; this process is repeated until pattern matching succeed.
  • In the reference section of the pattern recogni­tion apparatus, registered reference characters are typically classified into several categories. For instance, English characters are generally classified into alphabets and numerals. With such a simple reference structure, however, if an input character pattern is intricate, it may be undesirably and erro­neously matched with a reference character that belongs to a different category, but yet has a high similarity in shape. This impairs the accuracy of pat­tern recognition. Given that an input character is "1" (alphanumeral), if it is discriminated to be matched with a reference character belonging to a different category, such as "7" (also an alphanumeral), due to their similarity in shape, the pattern recognition pro­cess for the input would fail. Failure in pattern recognition of a handwritten character pattern deteriorates the accuracy of the recognition result and impairs the reliability of this pattern recognition pro­cess.
  • As a solution to this problem, there is an attempt or measure to additionally describes a detailed recog­nition program for each reference character in the reference section in association with each segment. More specifically, each reference segment of each reference character is added with detailed sub-data such as the length, curvature and positional rela­tionship betwen adjoining segments. In comparing and collating an input character pattern with reference characters, when the input is discriminated to be matched with reference characters of several different categories through a main pattern matching process, a detailed pattern recognition process is executed using the detailed recognition program to thereby determine which one of the probable categories is correct.
  • With such an arrangement, however, the reference data structure is very intricate and rewriting or updating the contents of the reference is not an easy task for operators. For instance, in updating reference segment data of one reference character, the detailed recognition program added to each segment of that reference character should also be updated every time. Such an updating work is troublesome to operators and it is significantly difficult to self-manage the correspon­dence or corresponding relationship between old and new detailed recognition programs existing respectively before and after updation.
  • It is therefore an object of the present invention to provide a new and improved pattern recognition apparatus which can automatically recognize strongly distorted input patterns with high accuracy and can facilitate the accompanying work of updating the con­tents of a reference to thereby improve reference management/maintenance by operators. In accordance with the above object, the present invention is addressed to a specific pattern recognition apparatus which comprises:
    a segmentation unit for receiving an input pattern and segmenting the input pattern into partial patterns;
    a characteristic extraction unit for extracting characteristics of each of the partial patterns as input pattern characteristic data;
    reference unit for storing characteristics of partial patterns of each of standard patterns as reference characteristic data; and
    a processor unit, coupled to the characteristic extraction unit and the reference unit, for executing a matching process involving comparison of the input pattern characteristic data with the reference charac­teristic data to find out, from the reference unit, that standard pattern with which the input pattern is matched.
  • The reference unit has first and second reference sections. The first reference section stores the stan­dard patterns and identification marks for identifying a specific one of the partial patterns of the standard patterns. The second reference section stores infor­mation for investigating detailed characteristics of each of the standard patterns. When the input pattern is matched with a plurality of standard patterns, the processor unit uses the detailed characteristic infor­mation to finally identify one standard pattern which is correctly matched with the input pattern from among the matched standard patterns.
  • The invention and its objectives and advantages will become more apparent from the detailed description of a preferred embodiment of this invention presented below.
  • In the following detailed description, reference is made to the accompanying drawings in which:
    • Fig. 1 is a schematic block diagram illustrating the general arrangement of a pattern recognition appara­tus according to a preferred embodiment of this inven­tion;
    • Fig. 2 is a diagram illustrating model contents of a main reference 22;
    • Fig. 3A is a diagram illustrating the shape of an input character pattern;
    • Fig. 3B is a diagram illustrating segments of the contour line of the input character pattern shown in Fig. 3A;
    • Fig. 4A is a model diagram illustrating charac­teristic data of reference segments of one reference pattern and identification marks being stored in the main reference 22;
    • Figs. 4B and 4C are diagrams illustrating standard character patterns stored in the main reference 22 and divided into segments;
    • Fig. 5 is a diagram illustrating plural pieces of detailed characteristic data given for the standard pat­tern shown in Fig. 4C; and
    • Fig. 6 is a diagram illustrating a schematic flowchart of a detailed pattern recognition process executed in a detailed recognition unit shown in Fig. 1, using detailed characteristic data.
  • Referring to Fig. 1, a pattern recognition appara­tus according to one preferred embodiment of this inven­tion is generally designated by reference numeral "10." This apparatus automatically recognizes handwritten character input patterns, for example, but it is not restricted to such an input; the input may be a graphic image or a voice.
  • An input pattern is scanned by, and subjected to the necessary processes in, an input unit 12, and is then supplied to a contour segmentation unit 14. This unit 14 detects the contour line of the input character pattern and segments the detected contour line into a plurality of partial patterns (called "contour segments" or simply "segments"). The segment data is transferred to a characteristic extraction unit 16 where it is sub­jected to characteristic extraction. The characteristic extraction unit 16 produces characteristic data of each segment of the input character pattern, which data is supplied to a matching processor 18.
  • A reference unit 20 is coupled to the matching processor 18, and includes a main reference 22 which is associated directly with the matching processor 18. The main reference 22 stores segments (reference segments) of each of standard or reference character patterns. These reference character patterns are stored, classified for the individual character categories, in the main reference 22. The matching processor 18 exe­cutes pattern recognition in such a manner that it com­pares and collates the input pattern with a certain reference pattern to identify that reference pattern which is matched with the input pattern with high simi­larity.
  • The matching processor 18 is coupled to a detailed recognition unit 24 which is associated with a detailed matching reference 26 included in the reference unit 20. When matching between the input pattern and a reference reference pattern is partially or entirely successful in the matching process executed by the matching processor 18, then the detailed recog­nition unit 24 becomes operative. The detailed recog­nition unit 24 performs a detailed pattern recognition process on the input pattern based on data or a pro­gram described in the detailed matching reference 26. The final recognition result attained by the detailed recognition unit 24 is output from an output unit 28.
  • Fig. 2 illustrates a model status of reference pattern stored in the main reference 22. For each of reference character patterns (hereinafter referred to as "category") C1, C2, ..., Cn, partial patterns (referred to as "reference segments") are stored as characteristic data A, B, C, ..., Z in the main reference 22. Those of the reference segments A, B, C, ..., Z which are used in a detailed pattern recognition process in the detailed recognition unit 24, are affixed with identification marks as shown in Fig. 2. According to this embodiment, the identification mark No. 4 is affixed to the characteristic data of the reference segment B, and the identification mark No. 3 to the characteristic data of the reference segment Z.
  • The detailed matching reference 26 stores such a description as "Specify segments (X, Y) of the input pattern respectively corresponding to identification marks No. 3 and No. 4 and make a detailed discrimination using the correspondence between them" which is used in comparison and collation between the input pattern and the reference pattern C1. In executing a detailed recognition process on the matching result from the matching processor 18, in accordance with the above particular description stored in the detailed matching reference 26, the detailed recognition unit 24 specifies the input segments X and Y corresponding to the reference segments B and Z of the reference pattern having the identification marks No. 3 and No. 4 and executes the detailed recognition process using the correspondence between these segments. The description in the detailed matching reference 26, concerning the use of the above marks, is automatically transferred to the detailed recognition unit 24 at the same time the detailed recognition process is initiated, thus ensuring smooth operation of the detailed recognition unit 24.
  • The automatic input character recognition operation of the pattern recognition apparatus will now be described in more detail with reference to the input pattern exemplified in Fig. 3. When the numeral pattern "7" shown in Fig. 3A is input to the input unit 12, the contour segmentation unit 14 detects the general contour line of the input pattern and segments the detected con­tour pattern into partial contour lines a to h as shown in Fig. 3B. These partial contour lines are hereinafter called "partial patterns" or "input segments" each of which is accompanied by its characteristic data such as position data, length data and vector data indicating the tangential direction.
  • Reference pattern characteristic data defining the sequence of reference segments as shown in Fig. 4A is registered in the main reference 22; the alphabets "e," "s" and "c" included in the reference characteristic data represents data about the characteristics of the partial contours. For instance, "e" indicates that the partial contour is the end, "s" the middle linear por­tion, and "c" the middle curved portion. The code data sequence of the reference segment characteristic data, "escsescs," shown in Fig. 4A may result in that it indi­cates the character characteristic of both the character pattern "1" (Fig. 4B) and the character pattern "7" (Fig. 4C). This means that the matching process in the matching processor 18 using the reference data stored in main reference 22 alone would cause the input pat­tern to be discriminated to be matched with both of dif­ferent character patterns "1" and "7." Therefore, the totally accurate pattern recognition cannot be expected from the matching process in the matching processor 18 alone. In other words, the matching process in the matching processor 18 using the reference data stored in the main reference 22 alone cannot accurately distinguish whether the input pattern is the reference character pattern "1" or "7." To realize the accurate recognition, it is necessary to execute an additional process in the detailed recognition unit 24 using the detailed matching reference 26.
  • When the input segment patterns shown in Fig. 3A are transferred to the characteristic extraction unit 16 from the contour segmentation unit 14, the unit 16 com­pares and collates the input pattern characteristic data with the reference pattern characteristic data shown in Fig. 4A for each corresponding segments. At this time, the correspondence between the input pattern segments and the reference pattern segments, as indicated below, is discriminated.
    Input Segment Reference Segment
    d 1
    e 2
    f 3
    g 4
    h 5
    a 6
    b 7
    c 8
  • In this case, if it is verified that the input pat­tern is matched with the pattern "7" and that it is also matched with the pattern "1", the matching processor 18 outputs such a pattern recognition result that the input pattern is matched with the reference patterns "1" and "7". This recognition result is supplied to the detailed recognition unit 24 located on the next stage.
  • The detailed recognition unit 24 executes a final recognition such that it performs a detailed recognition process on the recognition result from the matching processor 18 and discriminates whether the input pattern is the reference pattern "1" or "7". At this time, the detailed recognition process is executed in accor­dance with the description about the use of the marks stored in the detailed matching reference 26. Accord­ing to this embodiment, as shown in Fig. 5, the description stored in the detailed matching reference 26 includes the following as detailed recognition data:
    • (i) The horizontal distance X between the left­most point of the input segment corresponding to refer­ence segment 1 and the rightmost point of the input segment 7 (see steps 50, 52, and 54 in Fig. 6);
    • (ii) The vertical distance Y between the lower-most point of the segment 1 and the uppermost point of the input segment 3 (see steps 60, 62, and 64 in Fig. 6);
    • (iii) The height data of an input pattern or the vertical distance V between the uppermost point of the input segment corresponding to the reference segment 3 and the lowermost point of the input segment corre­sponding to the reference segment 5 (see step 56); and
    • (iv) The directional data about the middle linear segment 2 or the vector data representing the inclina­tion of this segment (see step 66).
  • Using these detailed recognition information, the detailed recognition unit 24 executes the detailed recognition process comprising the following steps with respect to the corresponding segments of the input character pattern:
    • (1) Comparing X/V with a predetermined threshold value Th1 (step 58);
    • (2) Comparing Y/X with a predetermined threshold value Th2 (step 66); and
    • (3) Comparing the direction A with a predetermined threshold value Th3 (step 68).
  • If this process results in
    X/V < Th1;
    Y/X > Th2; and
    Direction A > Th3,
    the input pattern is finally recognized to be the numeral "1" (step 70). If the above discrimination is negative for all the conditions or parameters, the input pattern is finally recognized to be the numeral "7" (step 72).
  • It should be noted that in the above detailed recognition process executed by the unit 24, the detailed characteristic data V can be attained substan­tially irrespective of the data about the partial con­tours of the input pattern; however, the other detailed characteristic data X, Y and A should be computed by actually using the contour data of the input segments corresponding to the reference segments 1, 2, 3 and 7. With regard to the standard pattern characteristic data shown in Fig. 4A, therefore, marks "#Pi" (i = 0, 1, 2, 3) are affixed to those portions "1," "2," "3" and "7" which respectively correspond to the reference segments 1, 2, 3, and 7 segments necessary to compute the detailed recognition data in the above detailed recogni­tion process. Referring to those additionally provided segment special codes, the detailed recognition unit 24 can effectively search the reference segments which are used to compute the values of the detailed charac­teristic data X, Y and A necessary for executing the detailed recognition process. This can ensure effective extraction of the characteristic data of the partial contour lines of the input pattern, i.e., the input segments, thereby improving the efficiency of the detailed recognition process. Fig. 6 illustrates a flowchart for the final pattern recognition process and computation of the values of the detailed recognition data X, Y, V and A. A program that defines the rules for the process sequence is stored as firmware in the detailed matching reference 26.
  • According to the present pattern recognition apparatus, the detailed recognition unit 24 is designed such that using the detailed recognition data stored in the detailed matching reference 26, the unit 24 additionally performs a secondary pattern recognition or a detailed recognition process with respect to the matching result attained by the primary pattern recogni­tion executed by the matching processor 18 using the main reference 22. Accordingly, the accuracy in recognizing an input pattern can be improved.
  • Furthermore, according to the present apparatus, the matching reference unit 20 has the main reference 22 for storing reference segment characteristic data of standard patterns and the detailed matching reference 26 for storing detailed characteristic data and a detailed recognition program. Marks #P are added to specific segments to identify them, which are included in the segments of each reference pattern stored in the main reference 22 and are necessary to compute the detailed recognition data in a detailed recognition pro­cess. In updating the contents of the main reference 22, for example, the reference pattern corresponding to a certain reference character, the correspondence be­tween the reference segments before updation and those after updation can be easily and clearly made using the marks #P. Therefore, the detailed recognition data and the content of the detailed recognition program, both stored in the detailed matching reference 26, can be still effec-tively used even after updation of a reference pattern. Unnecessariness to alter the con­tents of the detailed matching reference 26 in updating the contents of the main reference 22 permits operators to easily and effectively deal with the contents of the matching reference unit 20. This can, therefore, faci­litate updation of the reference contents of a pattern recognition apparatus capable of performing pattern recognition with high accuracy and can improve the management and maintenance of the reference unit by operators.
  • Although the invention has been described with reference to a specific embodiment, it shall be understood by those skilled in the art that numerous modifications may be made that are within the spirit and scope of the invention.
  • For instance, the method for segmenting an input pattern to be recognized into partial patterns, the characteristic extraction method, the matching method and the specific method for affixing identification marks, all employed in the present pattern recognition apparatus, may be basically changed to other existing methods. Further, the storage format of the reference patterns is not restricted to the above-described speci­fic format, but may be modified in various forms. Furthermore, the above pattern recognition technique can be applied not only to characters but also other types of input patterns such as picture images and voices.

Claims (7)

1. A pattern recognition apparatus (10) comprising: segmentation means (14) for receiving an input pattern and for segmenting said input pattern into partial pat­terns; characteristic extraction means (16) for extracting a characteristic of each of said partial pat­terns as input pattern characteristic data; reference storage means (20) for storing characteristics of par­tial patterns of each of standard patterns as reference pattern characteristic data; and identification means (18, 24), coupled to said characteristic extraction means (16) and said reference storage means (20), for executing a matching process involving comparison of said input pattern characteristic data with said reference pattern characteristic data to find out, from said reference storage means, that standard pattern with which said input pattern is matched, characterized in that said reference storage means (20) comprises:
a first reference section (22) for storing said standard patterns and plural pieces of additional data (e, s, c) for identifying a specific one of said partial patterns of said standard patterns; and
a second reference section (26) for storing detailed characteristics of each of said standard pat­terns as detailed recognition information, whereby when said input pattern is matched with a plurality of stan­dard patterns, said identification means (18, 24) uses said detailed recognition information to finally iden­tify one standard pattern which is correctly matched with said input pattern from among the matched standard patterns.
2. The apparatus according to claim 1, charac­terized in that said detailed recognition information includes first characteristic data (A) for specifying detailed characteristics of said specific partial pat­tern of each of said standard patterns.
3. The apparatus according to claim 2, charac­terized in that said detailed recognition information includes second characteristic data for specifying general characteristics of each of said standard pat­terns.
4. The apparatus according to claim 3, charac­terized in that said second reference section (26) stores said first and second characteristic data in association with said additional data (e, s, c).
5. The apparatus according to claim 4, charac­terized in that said second reference section stores a program for a detailed pattern recognition process using said first and second characteristic data, and said identification means (24) executes said detailed pattern recognition process in accordance with said program.
6. An apparatus (10) for automatically recognizing an input pattern, comprising: segmentation means (14) for segmenting an input pattern into input segments; charac­teristic extraction means (16) for extracting a charac­teristic of each of said input segments; reference storage means (20) for storing characteristic data of standard patterns; and matching process means (18), coupled to said characteristic extraction means (16) and said dictionary means (20), for executing a primary matching process in which said input pattern is sequen­tially compared and collated with said standard patterns to find out that standard pattern with which said input pattern is matched with a highest similarity, charac­terized in that said reference storage means (20) in­cludes main reference storage means (22) for storing partial pattern characteristic data representing charac­teristics of reference segments of each of said standard patterns, and detailed matching reference storage means (26) for storing detailed charac-teristic data of each of said standard patterns together with a program for specifying an operation sequence of said detailed characteristic data, in that detailed recognition means (24) is coupled to said matching process means (18) and said detailed matching reference storage means (24) for executing a secondary recognition process in such a manner that, when said input pattern is matched with several standard patterns, said detailed recogni­tion means (24) executes a detailed recognition process using said detailed characteristic data of said standard patterns to select a correct standard pattern from among said matched standard patterns, and in that said main reference storage means (22) additionally stores iden­tification marks (#P0, #P1, #P2, #P3) for identifying specific reference segments necessary to acquire said detailed characteristic data.
7. The apparatus according to claim 6, charac­terized in that said identification marks include code data.
EP19890303116 1988-03-29 1989-03-29 Pattern recognition apparatus Withdrawn EP0335696A3 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP73090/88 1988-03-29
JP63073090A JPH01246678A (en) 1988-03-29 1988-03-29 Pattern recognizing device

Publications (2)

Publication Number Publication Date
EP0335696A2 true EP0335696A2 (en) 1989-10-04
EP0335696A3 EP0335696A3 (en) 1991-08-21

Family

ID=13508293

Family Applications (1)

Application Number Title Priority Date Filing Date
EP19890303116 Withdrawn EP0335696A3 (en) 1988-03-29 1989-03-29 Pattern recognition apparatus

Country Status (3)

Country Link
US (1) US4972499A (en)
EP (1) EP0335696A3 (en)
JP (1) JPH01246678A (en)

Families Citing this family (46)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE69032318T2 (en) * 1989-08-31 1998-09-24 Canon Kk Image processing device
WO1991011783A1 (en) * 1990-01-23 1991-08-08 Massachusetts Institute Of Technology Recognition of patterns in images
US5313532A (en) * 1990-01-23 1994-05-17 Massachusetts Institute Of Technology Recognition of patterns in images
JPH03223976A (en) * 1990-01-29 1991-10-02 Ezel Inc Image collating device
JP2745764B2 (en) * 1990-03-08 1998-04-28 松下電器産業株式会社 Position recognition device
US5212739A (en) * 1990-10-17 1993-05-18 Hewlett-Packard Company Noise tolerant optical character recognition system
US5237627A (en) * 1991-06-27 1993-08-17 Hewlett-Packard Company Noise tolerant optical character recognition system
US6418424B1 (en) 1991-12-23 2002-07-09 Steven M. Hoffberg Ergonomic man-machine interface incorporating adaptive pattern recognition based control system
US8352400B2 (en) 1991-12-23 2013-01-08 Hoffberg Steven M Adaptive pattern recognition based controller apparatus and method and human-factored interface therefore
US6850252B1 (en) 1999-10-05 2005-02-01 Steven M. Hoffberg Intelligent electronic appliance system and method
US6400996B1 (en) 1999-02-01 2002-06-04 Steven M. Hoffberg Adaptive pattern recognition based control system and method
US7242988B1 (en) 1991-12-23 2007-07-10 Linda Irene Hoffberg Adaptive pattern recognition based controller apparatus and method and human-factored interface therefore
US10361802B1 (en) 1999-02-01 2019-07-23 Blanding Hovenweep, Llc Adaptive pattern recognition based control system and method
US5903454A (en) 1991-12-23 1999-05-11 Hoffberg; Linda Irene Human-factored interface corporating adaptive pattern recognition based controller apparatus
US5901255A (en) * 1992-02-07 1999-05-04 Canon Kabushiki Kaisha Pattern recognition method and apparatus capable of selecting another one of plural pattern recognition modes in response to a number of rejects of recognition-processed pattern segments
US5359671A (en) * 1992-03-31 1994-10-25 Eastman Kodak Company Character-recognition systems and methods with means to measure endpoint features in character bit-maps
IL106401A (en) * 1992-08-07 1997-02-18 Donnelley & Sons Co Method and system for converting bitmap data into page definition language commands
US5742702A (en) * 1992-10-01 1998-04-21 Sony Corporation Neural network for character recognition and verification
US5319722A (en) * 1992-10-01 1994-06-07 Sony Electronics, Inc. Neural network for character recognition of rotated characters
US6212299B1 (en) 1992-12-11 2001-04-03 Matsushita Electric Industrial Co., Ltd. Method and apparatus for recognizing a character
JP3448868B2 (en) * 1993-07-31 2003-09-22 ソニー株式会社 Image coincidence detecting device and image coincidence detecting method
US5828836A (en) * 1993-10-08 1998-10-27 International Business Machines Corporation Networked information communication system
EP0656602B1 (en) * 1993-12-02 2001-07-11 Nippon Telegraph And Telephone Corporation Image pattern identification/recognition method
IL111039A (en) * 1994-09-22 1998-08-16 Advanced Recognition Tech Handwritten pattern recognizer
JPH08194780A (en) * 1994-11-18 1996-07-30 Ricoh Co Ltd Feature extraction method
US5627915A (en) * 1995-01-31 1997-05-06 Princeton Video Image, Inc. Pattern recognition system employing unlike templates to detect objects having distinctive features in a video field
JPH08263667A (en) * 1995-03-28 1996-10-11 Nec Corp Method for recognizing rorschach picture
US5862259A (en) * 1996-03-27 1999-01-19 Caere Corporation Pattern recognition employing arbitrary segmentation and compound probabilistic evaluation
US5937084A (en) * 1996-05-22 1999-08-10 Ncr Corporation Knowledge-based document analysis system
JP2836681B2 (en) * 1997-01-14 1998-12-14 日本電気株式会社 Pattern matching encoding method and encoding device
JP3082837B2 (en) * 1997-03-19 2000-08-28 日本電気株式会社 Pattern matching encoding device, decoding device, and recording medium
DE19820353C2 (en) * 1998-05-07 2001-06-13 Ibm Method and device for recognizing a pattern on a template
US6327388B1 (en) 1998-08-14 2001-12-04 Matsushita Electric Industrial Co., Ltd. Identification of logos from document images
US7966078B2 (en) 1999-02-01 2011-06-21 Steven Hoffberg Network media appliance system and method
JP4224882B2 (en) * 1999-02-09 2009-02-18 ソニー株式会社 Data processing apparatus and data processing method
US6738512B1 (en) * 2000-06-19 2004-05-18 Microsoft Corporation Using shape suppression to identify areas of images that include particular shapes
US8682077B1 (en) 2000-11-28 2014-03-25 Hand Held Products, Inc. Method for omnidirectional processing of 2D images including recognizable characters
JP4019790B2 (en) * 2002-05-23 2007-12-12 富士ゼロックス株式会社 Image processing apparatus and image processing program
US20040042665A1 (en) * 2002-08-30 2004-03-04 Lockheed Martin Corporation Method and computer program product for automatically establishing a classifiction system architecture
US7149357B2 (en) * 2002-11-22 2006-12-12 Lee Shih-Jong J Fast invariant pattern search
US7110603B2 (en) * 2003-04-16 2006-09-19 Lee Shih-Jong J Fast invariant matching using template decomposition and synthesis
US7463773B2 (en) * 2003-11-26 2008-12-09 Drvision Technologies Llc Fast high precision matching method
US20050276514A1 (en) * 2004-06-14 2005-12-15 Fisher Paul A Image monitoring method and system
JP2009259190A (en) * 2008-03-28 2009-11-05 Fujitsu Ltd Character recognition program and character recognition device
JP5293321B2 (en) * 2009-03-23 2013-09-18 株式会社豊田中央研究所 Object identification device and program
US20160034756A1 (en) * 2014-07-30 2016-02-04 Lexmark International, Inc. Coarse Document Classification

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE2012982A1 (en) * 1970-03-18 1971-10-14 Siemens Ag Method for the automatic recognition of alphanumeric characters
NL144071B (en) * 1971-07-22 1974-11-15 Nederlanden Staat DEVICE FOR READING AND COMBINED MACHINE AND MANUAL PROCESSING OF SIGNS.
JPS51118333A (en) * 1975-04-11 1976-10-18 Hitachi Ltd Pattern recognition system
DE3280099D1 (en) * 1981-09-11 1990-03-08 Burroughs Corp GEOMETRIC CHARACTER RECOGNITION WITH DISPLAY OF THE SKELETON AND THE STROKE WIDTH.
US4521909A (en) * 1983-10-04 1985-06-04 Wang Laboratories, Inc. Dual level pattern recognition system
US4589142A (en) * 1983-12-28 1986-05-13 International Business Machines Corp. (Ibm) Method and apparatus for character recognition based upon the frequency of occurrence of said characters

Also Published As

Publication number Publication date
EP0335696A3 (en) 1991-08-21
JPH01246678A (en) 1989-10-02
US4972499A (en) 1990-11-20

Similar Documents

Publication Publication Date Title
EP0335696A2 (en) Pattern recognition apparatus
US6339651B1 (en) Robust identification code recognition system
US6259812B1 (en) Key character extraction and lexicon reduction cursive text recognition
US5067165A (en) Character recognition method
US7120318B2 (en) Automatic document reading system for technical drawings
US6687401B2 (en) Pattern recognizing apparatus and method
US6470091B2 (en) Address reader, sorting machine such as a mail thing and character string recognition method
EP0054439A2 (en) Character segmentation method
US6327385B1 (en) Character segmentation device and character segmentation system
US4975974A (en) Character recognition apparatus
JP2000285190A (en) Form identification method, form identification device, and storage medium
CN117037185B (en) Wire number recognition method and device based on optical character recognition technology
US7072496B2 (en) Slap print segmentation system and method
CN118193652B (en) Asset type identification method for asset recovery management
JPH11328309A (en) Optical character reading method and apparatus
US20040114803A1 (en) Method of stricken-out character recognition in handwritten text
JPH09138837A (en) Form recognition device
JPH09305701A (en) Form recognition method
JPH0749926A (en) Character recognizing device
JP2985813B2 (en) Character string recognition device and knowledge database learning method
JP2993252B2 (en) Homomorphic character discrimination method and apparatus
JPH1011541A (en) Character recognition device
JPH0877293A (en) Character recognition apparatus and method for creating dictionary for character recognition
JPH11126216A (en) Automatic drawing input device
CN119007227A (en) High-precision OCR (optical character recognition) and automatic error correction method in file digitizing process

Legal Events

Date Code Title Description
PUAI Public reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text: ORIGINAL CODE: 0009012

17P Request for examination filed

Effective date: 19890412

AK Designated contracting states

Kind code of ref document: A2

Designated state(s): DE FR GB

PUAL Search report despatched

Free format text: ORIGINAL CODE: 0009013

AK Designated contracting states

Kind code of ref document: A3

Designated state(s): DE FR GB

17Q First examination report despatched

Effective date: 19920706

STAA Information on the status of an ep patent application or granted ep patent

Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18D Application deemed to be withdrawn

Effective date: 19930119