1324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver/* 2324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * [The "BSD licence"] 3324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * Copyright (c) 2005-2008 Terence Parr 4324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * All rights reserved. 5324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * 6324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * Conversion to C#: 7324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * Copyright (c) 2008-2009 Sam Harwell, Pixel Mine, Inc. 8324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * All rights reserved. 9324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * 10324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * Redistribution and use in source and binary forms, with or without 11324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * modification, are permitted provided that the following conditions 12324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * are met: 13324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * 1. Redistributions of source code must retain the above copyright 14324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * notice, this list of conditions and the following disclaimer. 15324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * 2. Redistributions in binary form must reproduce the above copyright 16324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * notice, this list of conditions and the following disclaimer in the 17324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * documentation and/or other materials provided with the distribution. 18324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * 3. The name of the author may not be used to endorse or promote products 19324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * derived from this software without specific prior written permission. 20324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * 21324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * THIS SOFTWARE IS PROVIDED BY THE AUTHOR ``AS IS'' AND ANY EXPRESS OR 22324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE IMPLIED WARRANTIES 23324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE ARE DISCLAIMED. 24324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * IN NO EVENT SHALL THE AUTHOR BE LIABLE FOR ANY DIRECT, INDIRECT, 25324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT 26324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE, 27324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY 28324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT 29324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE OF 30324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE. 31324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver */ 32324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 33324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruvernamespace Antlr.Runtime { 34324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver using System.Collections.Generic; 35324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver using CLSCompliant = System.CLSCompliantAttribute; 36324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver using ArgumentNullException = System.ArgumentNullException; 37324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 38324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary> 39324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * The set of fields needed by an abstract recognizer to recognize input 40324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * and recover from errors etc... As a separate state object, it can be 41324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * shared among multiple grammars; e.g., when one grammar imports another. 42324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * </summary> 43324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * 44324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * <remarks> 45324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * These fields are publically visible but the actual state pointer per 46324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * parser is protected. 47324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * </remarks> 48324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver */ 49324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public class RecognizerSharedState { 50324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary> 51324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * Track the set of token types that can follow any rule invocation. 52324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * Stack grows upwards. When it hits the max, it grows 2x in size 53324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * and keeps going. 54324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * </summary> 55324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver */ 56324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver //public List<BitSet> following; 57324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public BitSet[] following; 58324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver [CLSCompliant(false)] 59324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public int _fsp; 60324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 61324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary> 62324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * This is true when we see an error and before having successfully 63324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * matched a token. Prevents generation of more than one error message 64324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * per error. 65324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * </summary> 66324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver */ 67324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public bool errorRecovery; 68324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 69324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary> 70324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * The index into the input stream where the last error occurred. 71324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * This is used to prevent infinite loops where an error is found 72324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * but no token is consumed during recovery...another error is found, 73324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * ad naseum. This is a failsafe mechanism to guarantee that at least 74324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * one token/tree node is consumed for two errors. 75324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * </summary> 76324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver */ 77324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public int lastErrorIndex; 78324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 79324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary> 80324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * In lieu of a return value, this indicates that a rule or token 81324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * has failed to match. Reset to false upon valid token match. 82324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * </summary> 83324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver */ 84324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public bool failed; 85324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 86324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary>Did the recognizer encounter a syntax error? Track how many.</summary> */ 87324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public int syntaxErrors; 88324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 89324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary> 90324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * If 0, no backtracking is going on. Safe to exec actions etc... 91324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * If >0 then it's the level of backtracking. 92324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * </summary> 93324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver */ 94324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public int backtracking; 95324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 96324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary> 97324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * An array[size num rules] of Map<Integer,Integer> that tracks 98324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * the stop token index for each rule. ruleMemo[ruleIndex] is 99324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * the memoization table for ruleIndex. For key ruleStartIndex, you 100324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * get back the stop token for associated rule or MEMO_RULE_FAILED. 101324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * </summary> 102324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * 103324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * <remarks>This is only used if rule memoization is on (which it is by default).</remarks> 104324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver */ 105324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public IDictionary<int, int>[] ruleMemo; 106324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 107324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 108324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver // LEXER FIELDS (must be in same state object to avoid casting 109324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver // constantly in generated code and Lexer object) :( 110324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 111324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 112324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary> 113324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * The goal of all lexer rules/methods is to create a token object. 114324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * This is an instance variable as multiple rules may collaborate to 115324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * create a single token. nextToken will return this object after 116324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * matching lexer rule(s). If you subclass to allow multiple token 117324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * emissions, then set this to the last token to be matched or 118324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * something nonnull so that the auto token emit mechanism will not 119324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * emit another token. 120324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * </summary> 121324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver */ 122324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public IToken token; 123324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 124324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary> 125324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * What character index in the stream did the current token start at? 126324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * Needed, for example, to get the text for current token. Set at 127324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * the start of nextToken. 128324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * </summary> 129324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver */ 130324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public int tokenStartCharIndex; 131324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 132324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary>The line on which the first character of the token resides</summary> */ 133324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public int tokenStartLine; 134324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 135324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary>The character position of first character within the line</summary> */ 136324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public int tokenStartCharPositionInLine; 137324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 138324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary>The channel number for the current token</summary> */ 139324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public int channel; 140324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 141324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary>The token type for the current token</summary> */ 142324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public int type; 143324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 144324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver /** <summary> 145324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * You can set the text for the current token to override what is in 146324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * the input char buffer. Use setText() or can set this instance var. 147324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver * </summary> 148324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver */ 149324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public string text; 150324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 151324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public RecognizerSharedState() { 152324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver //following = new List<BitSet>( BaseRecognizer.InitialFollowStackSize ); 153324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver following = new BitSet[BaseRecognizer.InitialFollowStackSize]; 154324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver _fsp = -1; 155324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver lastErrorIndex = -1; 156324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver tokenStartCharIndex = -1; 157324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver } 158324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 159324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver public RecognizerSharedState(RecognizerSharedState state) { 160324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver if (state == null) 161324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver throw new ArgumentNullException("state"); 162324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 163324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver following = (BitSet[])state.following.Clone(); 164324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver _fsp = state._fsp; 165324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver errorRecovery = state.errorRecovery; 166324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver lastErrorIndex = state.lastErrorIndex; 167324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver failed = state.failed; 168324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver syntaxErrors = state.syntaxErrors; 169324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver backtracking = state.backtracking; 170324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 171324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver if (state.ruleMemo != null) 172324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver ruleMemo = (IDictionary<int, int>[])state.ruleMemo.Clone(); 173324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver 174324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver token = state.token; 175324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver tokenStartCharIndex = state.tokenStartCharIndex; 176324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver tokenStartCharPositionInLine = state.tokenStartCharPositionInLine; 177324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver channel = state.channel; 178324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver type = state.type; 179324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver text = state.text; 180324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver } 181324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver } 182324c4644fee44b9898524c09511bd33c3f12e2dfBen Gruver} 183