1324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver/* 2324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * [The "BSD licence"] 3324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * Copyright (c) 2005-2008 Terence Parr 4324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * All rights reserved. 5324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * 6324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * Conversion to C#: 7324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * Copyright (c) 2008-2009 Sam Harwell, Pixel Mine, Inc. 8324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * All rights reserved. 9324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * 10324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * Redistribution and use in source and binary forms, with or without 11324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * modification, are permitted provided that the following conditions 12324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * are met: 13324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * 1. Redistributions of source code must retain the above copyright 14324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * notice, this list of conditions and the following disclaimer. 15324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * 2. Redistributions in binary form must reproduce the above copyright 16324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * notice, this list of conditions and the following disclaimer in the 17324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * documentation and/or other materials provided with the distribution. 18324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * 3. The name of the author may not be used to endorse or promote products 19324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * derived from this software without specific prior written permission. 20324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * 21324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * THIS SOFTWARE IS PROVIDED BY THE AUTHOR ``AS IS'' AND ANY EXPRESS OR 22324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES 23324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. 24324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * IN NO EVENT SHALL THE AUTHOR BE LIABLE FOR ANY DIRECT, INDIRECT, 25324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT 26324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, 27324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY 28324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT 29324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF 30324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE. 31324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver */ 32324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 33324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruvernamespace Antlr.Runtime 34324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver{ 35324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver using System.Collections.Generic; 36324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver using CLSCompliant = System.CLSCompliantAttribute; 37324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver using ArgumentNullException = System.ArgumentNullException; 38324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 39324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary> 40324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * The set of fields needed by an abstract recognizer to recognize input 41324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * and recover from errors etc... As a separate state object, it can be 42324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * shared among multiple grammars; e.g., when one grammar imports another. 43324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * </summary> 44324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * 45324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * <remarks> 46324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * These fields are publically visible but the actual state pointer per 47324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * parser is protected. 48324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * </remarks> 49324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver */ 50324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public class RecognizerSharedState 51324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver { 52324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary> 53324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * Track the set of token types that can follow any rule invocation. 54324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * Stack grows upwards. When it hits the max, it grows 2x in size 55324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * and keeps going. 56324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * </summary> 57324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver */ 58324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver //public List<BitSet> following; 59324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public BitSet[] following; 60324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver [CLSCompliant( false )] 61324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public int _fsp; 62324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 63324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary> 64324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * This is true when we see an error and before having successfully 65324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * matched a token. Prevents generation of more than one error message 66324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * per error. 67324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * </summary> 68324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver */ 69324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public bool errorRecovery; 70324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 71324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary> 72324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * The index into the input stream where the last error occurred. 73324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * This is used to prevent infinite loops where an error is found 74324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * but no token is consumed during recovery...another error is found, 75324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * ad naseum. This is a failsafe mechanism to guarantee that at least 76324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * one token/tree node is consumed for two errors. 77324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * </summary> 78324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver */ 79324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public int lastErrorIndex; 80324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 81324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary> 82324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * In lieu of a return value, this indicates that a rule or token 83324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * has failed to match. Reset to false upon valid token match. 84324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * </summary> 85324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver */ 86324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public bool failed; 87324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 88324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary>Did the recognizer encounter a syntax error? Track how many.</summary> */ 89324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public int syntaxErrors; 90324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 91324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary> 92324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * If 0, no backtracking is going on. Safe to exec actions etc... 93324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * If >0 then it's the level of backtracking. 94324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * </summary> 95324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver */ 96324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public int backtracking; 97324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 98324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary> 99324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * An array[size num rules] of Map<Integer,Integer> that tracks 100324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * the stop token index for each rule. ruleMemo[ruleIndex] is 101324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * the memoization table for ruleIndex. For key ruleStartIndex, you 102324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * get back the stop token for associated rule or MEMO_RULE_FAILED. 103324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * </summary> 104324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * 105324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * <remarks>This is only used if rule memoization is on (which it is by default).</remarks> 106324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver */ 107324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public IDictionary<int, int>[] ruleMemo; 108324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 109324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 110324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver // LEXER FIELDS (must be in same state object to avoid casting 111324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver // constantly in generated code and Lexer object) :( 112324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 113324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 114324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary> 115324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * The goal of all lexer rules/methods is to create a token object. 116324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * This is an instance variable as multiple rules may collaborate to 117324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * create a single token. nextToken will return this object after 118324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * matching lexer rule(s). If you subclass to allow multiple token 119324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * emissions, then set this to the last token to be matched or 120324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * something nonnull so that the auto token emit mechanism will not 121324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * emit another token. 122324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * </summary> 123324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver */ 124324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public IToken token; 125324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 126324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary> 127324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * What character index in the stream did the current token start at? 128324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * Needed, for example, to get the text for current token. Set at 129324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * the start of nextToken. 130324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * </summary> 131324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver */ 132324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public int tokenStartCharIndex; 133324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 134324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary>The line on which the first character of the token resides</summary> */ 135324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public int tokenStartLine; 136324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 137324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary>The character position of first character within the line</summary> */ 138324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public int tokenStartCharPositionInLine; 139324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 140324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary>The channel number for the current token</summary> */ 141324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public int channel; 142324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 143324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary>The token type for the current token</summary> */ 144324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public int type; 145324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 146324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary> 147324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * You can set the text for the current token to override what is in 148324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * the input char buffer. Use setText() or can set this instance var. 149324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * </summary> 150324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver */ 151324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public string text; 152324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 153324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public RecognizerSharedState() 154324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver { 155324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver //following = new List<BitSet>( BaseRecognizer.InitialFollowStackSize ); 156324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver following = new BitSet[BaseRecognizer.InitialFollowStackSize]; 157324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver _fsp = -1; 158324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver lastErrorIndex = -1; 159324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver tokenStartCharIndex = -1; 160324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver } 161324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 162324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public RecognizerSharedState( RecognizerSharedState state ) 163324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver { 164324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver if (state == null) 165324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver throw new ArgumentNullException("state"); 166324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 167324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver following = (BitSet[])state.following.Clone(); 168324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver _fsp = state._fsp; 169324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver errorRecovery = state.errorRecovery; 170324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver lastErrorIndex = state.lastErrorIndex; 171324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver failed = state.failed; 172324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver syntaxErrors = state.syntaxErrors; 173324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver backtracking = state.backtracking; 174324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 175324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver if ( state.ruleMemo != null ) 176324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver ruleMemo = (IDictionary<int, int>[])state.ruleMemo.Clone(); 177324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 178324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver token = state.token; 179324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver tokenStartCharIndex = state.tokenStartCharIndex; 180324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver tokenStartCharPositionInLine = state.tokenStartCharPositionInLine; 181324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver channel = state.channel; 182324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver type = state.type; 183324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver text = state.text; 184324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver } 185324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver } 186324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver} 187