|
| |
| | Agrep - Wikipedia, the free encyclopedia |
 | | agrep (approximate grep) is a fuzzy string searching program, developed by Udi Manber and Sun Wu between 1988 and 1991, for use with the Unix operating system. |  | | There is also another implementation of agrep as a part of the TRE regexp matching package. |  | | It selects the best-suited algorithm for the current query from a variety of the known fastest (built-in) string searching algorithms, including Manber and Wu's bitap algorithm based on Levenshtein distances. |
|
http://en.wikipedia.org/wiki/Agrep
(159 words)
|
|
| |
| | Python port of agrep string matching with errors (Version 1.2) |
 | | Given an input pattern and a file, agrep prints on standard output all records that match the pattern (so a single match suffices to have a record printed). |  | | Agrep, written by Sun Wu and Udi Manber (described in "Fast Text Searching Allowing Errors", CACM, 35(10), 1992), is a suite of C functions which together perform various string matching operations under UNIX (i.e. |  | | "agrep", now called, sagrep, deals with "short" pattern strings (setable by a header constant, currently 24), and "a_monkey", now called lagrep, which deals with longer strings. |
|
http://www.bio.cam.ac.uk/~mw263/pyagrep.html
(557 words)
|
|
| |
| | A tool to search compressed textual files |
 | | Agrep is used by our software to perform the complicated searches over the dictionary of tokens; so that if you wish to substitute it, you should find or implement an equivalent (both in functionality and interface) program. |  | | The cgrepand agrep options are numerous, so the we refer the user to the manpage of cgrep and agrep. |  | | agrep: is a sophisticated pattern-matching utility for uncompressed files proposed by Wu-Manber (1992). |
|
http://roquefort.di.unipi.it/~ferrax/CompressedSearch
(1478 words)
|
|
| |
| | Simple Search Tools Can Save Time |
 | | Although WAIS and agrep both support multi-database queries, it is important to understand that the way data is organized varies considerably across databases. |  | | NAL's genome databases can be queried using a spectrum of tools which range in complexity from WAIS and agrep through "query by example," the "query builder," and the ACEDB query language itself. |  | | WAIS and agrep queries are particularly useful if your search can be expressed as one or a few words or if you want to examine every record (or "object") in a database (the information is in there somewhere, but where?). |
|
http://www.nal.usda.gov/pgdic/Probe/v4n3_4/simple.html
(1091 words)
|
|
| |
| | Glimpse manual page(1) |
 | | e.g., agrep /ftp/pub will match /home/ftp/pub and /ftp/pub/whatever. |  | | Glimpse supports most of agrep's options (agrep is our powerful version of grep) including approximate matching (e.g., finding misspelled words), Boolean queries, and even some limited forms of regular expressions. |  | | For example, glimpse -1 pattern file is the same as agrep -1 pattern file. |
|
http://www.ccsf.edu/Pub/Mj/glimpsehelp.html
(5162 words)
|
|
| |
| | [No title] |
 | | "agrep -B homogenos foo" will generate a message of the form best match has 2 errors, there are 5 matches, output them? |  | | Another example: "agrep -d '$$' pattern foo" will output all paragraphs (separated by an empty line) that contain pattern. |  | | Title: agrep Version: 2.01 Submitted: Feb 23 1992 Author: sw@cs.arizona.edu (Sun Wu) Submittor: sw@cs.arizona.edu (Sun Wu) Description: very fast grep with approximate pattern matching [ I've been using this since I got it, and it's GREAT. |
|
http://www.isc.org/sources/utils/text/searching/agrep-2.01.txt
(358 words)
|
|
| |
| | The TIP Application Suite |
 | | Agrep simply loops through the argument list and informs the file system of the files it will read. |  | | In our benchmark, Agrep searches 1349 kernel source files occupying 2922 disk blocks for a simple string that does not occur in any of the files. |  | | When searching data collections such as software header files or mail messages, hints from Agrep frequently specify hundreds of files too small to benefit from history-based readahead. |
|
http://www.pdl.cmu.edu/TIP/applications.html
(1497 words)
|
|
| |
| | Agrep |
 | | Agrep: Is a program that will allow you to search for ASCII characters in text or binary file or a list of files. |  | | The output will tell you how many occurences the text appears in the file. |
|
http://www.mw-inc.com/agrep.shtml
(37 words)
|
|
| |
| | How I used DNS, a WebServer, and AGREP to create an Organization Website Search Engine |
 | | This is more for a single organizational webring such as the Knights but not all of our members are hosted on the same domain so it’s just a design goal, not a requirement. |  | | Sixth, I used AGREP with this search string and the flat-text file created above to obtain the line number or numbers that match the search string the best |  | | Using a Global DNS Record, an Apache Web Server, PERL, a few simple CGI’s and AGREP, this document will introduce you to our solution for creating an intuitive and easy to navigate Organizational Structure for related Websites with automatic Directory, Fuzzy Logic and Search capabilities for organizations that may involve 5, 50 or 50,000 entities. |
|
http://www.peregrinehw.com/downloads/agrep/webring-howto.html
(1060 words)
|
|
| |
| | AGREP, an approximate GREP |
 | | AGREP is a powerful tool for fast searching a file or many files for a string or regular expression, with approximate matching capabilities and user-definable records. |  | | AGREP ports to OS/2 and DOS and to Windows 95 and NT The first port of AGREP to OS/2 was done by Robert Mahoney (2Rud Software). |  | | Unlike other agrep implementations (like the one by Sun Wu and Udi Manber from University of Arizona available here) TRE agrep allows full regexps of any length, any number of errors, and non-uniform costs for insertion, deletion and substitution. |
|
http://www.tgries.de/agrep
(3306 words)
|
|
| |
| | agrep Details |
 | | agrep is a version of standard UNIX grep with the following |
|
http://www.linspire.com/lindows_products_details.php?product_id=8454
(80 words)
|
|
| |
| | KRUD Linux Package tre-agrep |
 | | The agrep tool searches text input for lines (or records separated by strings matching arbitrary regexps) that contain an approximate, or fuzzy, match to a specified regexp, and prints the matching lines. |  | | Unlike other agrep implementations, TRE agrep allows full POSIX regexps of any length, any number of errors, and non-uniform costs. |  | | Limits can be set on how many errors of each kind are allowed, or only the best matching lines can be output. |
|
http://www.tummy.com/Products/krudserver/packages/tre-agrep.html
(127 words)
|
|
| |
| | GNU Help Wanted Items - News: Free Software agrep replacement found [Savannah] |
 | | I've closed the "agrep replacement" item on the task list and recommend that anyone who was interested in working on that task consider contributing to TRE. |  | | TRE is "a lightweight, robust, and efficient POSIX compliant regexp matching library with some exciting features such as approximate (fuzzy) matching." It's licensed under the terms of the GPL and includes a command-line program that implements agrep functionality. |  | | GNU Help Wanted Items - News: Free Software agrep replacement found |
|
http://savannah.gnu.org/forum/forum.php?forum_id=3134
(211 words)
|
|
| |
| | Using Fuzzy Searching To Retrieve Plant Genome Information at the USDA, NAL |
 | | Wildcarding (for example: the use of asterisks in some search software) aids the user to a certain degree, but often the results returned are not exactly what is desired. |  | | In addition, because agrep searching involves paging through large text files, you can save time by selecting only the databases of interest. |  | | For example, searching for the pattern "adh" with two mismatches will bring back any object containing an a, d, or h. |
|
http://www.nal.usda.gov/pgdic/Probe/v6n1/fuzzy.html
(512 words)
|
|
| |
| | Grep - Wikipedia, the free encyclopedia |
 | | There are many derivatives of grep, for example agrep which stands for approximate grep to facilitate fuzzy string searching, fgrep for fixed pattern searches, and egrep for searches involving more sophisticated regular expression syntax. |  | | Tcgrep is a rewrite of grep and uses Perl regular expression syntax. |  | | There are various command line switches available when using grep that modify the default behaviour. |
|
http://en.wikipedia.org/wiki/Grep
(379 words)
|
|
| |
| | REC: Approximation of EU Environmental Law: Country Reports: Hungary: part 2 |
 | | Everyone shall have the right to acquire knowledge about facts and information on the environment, particularly about the state of the environment, the level of environmental pollution, environmental protection activities, as well as the impact of the environment on human health. |  | | The magnitude of the charges shall be established in such a way that they should encourage the users of the environment to reduce the load on and use of the environment." |  | | Commission Directive 67/548 EEC - Classification, packaging and labelling of dangerous substances |
|
http://www.rec.org/REC/Publications/EUlaw/FeeBased/Hungary2.html
(4634 words)
|
|
| |
| | [No title] |
 | | Agrep has other features not found in the grep family: It is record rather than line orientated; pattern matches can be output in a larger context than a single line. |  | | Glimpse uses this to search through all your files in the same way that agrep does. |  | | Its most notable feature is the approximate pattern matching facility. |
|
http://hpux.ee.ualberta.ca/ftp/hpux/Misc/DATABASE
(4841 words)
|
|
| |
| | Searching |
 | | So, given a pattern they first use agrep to find all the words in the index that match it, and then, using agrep again, they search the corresponding blocks to find the particular matches. |  | | Searching the index for a list of all blocks that may contain a match for the query. |  | | Agrep is similar in use to other grep's, but it is much more general. |
|
http://www.cs.technion.ac.il/~zgil/seminar96/search.html
(158 words)
|
|
| |
| | [No title] |
 | | ## If agrep is NULL (default), we want to use fuzzy matching iff ## 'pattern' contains no characters special to regular expressions. |
|
http://www.cs.uiowa.edu/~luke/classes/248/R-1.8.1/src/library/base/R/help.search.R
(333 words)
|
|
| |
| | [No title] |
 | | ;;; ;;; The `agrep' and `agrep-find' commands are interfaces to the ;;; approximate `grep` utility, which is distributed with the `glimpse' ;;; indexing and query tool (available from ;;; |
|
http://ftp.sunet.se/pub/gnu/emacs-lisp/archive/igrep.el
(1836 words)
|
|
| |
| | CLR Catalog - More info on AFGREP |
 | | The limitations on Line length : 8K Number of strings: 30000 The mgrep algorithm in agrep runs significantly faster than other implementations of fgrep I've seen, and I've optimized afgrep a little more to improve it's performance compared to agrep. |  | | On a Sparc machine, searching a 76000 line file for the presence of 2000 words, on the average, agrep ran in about 55 seconds and afgrep ran in about 40 seconds. |  | | The afgrep code was modelled after the agrep code, so it falls under the same copyright. |
|
http://crl.nmsu.edu/cgi-bin/Tools/CLR/clrinfo?AFGREP
(120 words)
|
|
| |
| | The grep family of UNIX utilities (Summer 1995) |
 | | Agrep is Ambiguous grep and allows selection of lines using regular expressions which "almost" match and it allows the user to define what "almost" means. |  | | Since agrep is not commonly shipped with UNIX but must be acquired from the net and built, I will only mention its network home: |  | | This operation (g/re/p) was so commonly performed that its function was abstracted from its editor context and promoted to the grep command. |
|
http://www.uga.edu/~ucns/tti/Computer_Review/Summer95/GREPS.html
(741 words)
|
|
| |
| | AGREP for VMS |
 | | AGREP is a Unix "grep" program but has approximate matching capabilities. |  | | The VMS port release was authorized by the original author in April 96. |
|
http://nucwww.chem.sunysb.edu/info/agrep.htmlx
(88 words)
|
|
| |
| | GLIMPSE: A Tool to Search Through Entire File Systems |
 | | Category: String Comment: Agrepの技法を巨大なファイルシステムに適用したものらしい。 インデクスを併用する。 Number: TR 93-34 Institution: Department of Computer Science, The University of Arizona Bibtype: TechReport URL: http://glimpse.cs.arizona.edu/ Author: Udi Manber Sun Wu Title: {GLIMPSE}: A Tool to Search Through Entire File Systems Comment1: 最近はWebGlimpseとして頑張っているらしい。 (2000/1 増井) Year: 1993 Date: 2003/08/01 04:59:50 References: Wu:agrep Crossref: \cite{Wu:agrep} Address: Tucson, Arizona |
|
http://www.pitecan.com/bib/Manber_Glimpse.html
(74 words)
|
|
| |
| | Re: Agrep linecount bug in large files... |
 | | Subject: Re: Agrep linecount bug in large files... |  | | Prev by thread: Re: Glimpse and partial matching. |  | | >> At 05:04 PM 3/29/00 -0600, Wilson Smith wrote: >> >agrep seems to miscount the line number when searching files larger than >> >the BlockSize / Max_record constants defined in agrep.h -- the line >> >number returned is larger than the number of the actual line which >> >matches... |
|
http://www.webglimpse.net/wgdev/msg00010.html
(229 words)
|
|
| |
| | agrep - approximate grep tool |
 | | I understand that Thomas Gries implemented agrep for DOS and OS/2. |  | | NAME agrep - search a file for a string or regular expression, with approximate matching capabilities SYNOPSIS agrep [-#cdehiklnpstvwxBDGIS] pattern [-f patternfile] [filename...] |  | | Please note that the first delimiter (sigdashes) are also in the output. |
|
http://www.guckes.net/agrep
(596 words)
|
|
| |
| | [No title] |
 | | 3) I've also added some lisp code to use agrep: AGREP(l) AGREP(l) NAME agrep - search a file for a string or regular expression, with approximate matching capabilities SYNOPSIS agrep [ -#cdehiklnpstvwxBDGIS ] pattern [ -f patternfile ] [ filename... |  | | Chuck> I have been successful with using the Grep tool to search Chuck> for strings within one directory, for example, if I do Chuck> grep -n -e XML * Chuck> while I am in the folder "from/tei" I can turn up all Chuck> strings with "XML". |  | | I can post this code if you're interested. |
|
http://www.mew.org/ml/mew-int-2.0/msg00679.html
(373 words)
|
|
| |
| | Glimpse |
 | | Glimpse (GLobal IMPlicit SEarch) is an indexing and query system that allows one to search through all files quickly. |  | | Glimpse is run by creating an index, usually run every night, of all the files at or below the home directory. |  | | It is very similar to agrep, a powerful grep version and supports most of agrep's options. |
|
http://ils.unc.edu/~marsc/inls283/glimpse.html
(652 words)
|
|
| |
| | Domains with Typographical Errors - A Simple Search Strategy |
 | | In general, such approximate searching is a well-analyzed problem, and a tool has been developed for performing approximate string-matching. |  | | Searches were done for all names in the above typographical errors dataset. |  | | Thus, one need only perform an agrep search over the appropriate domain space in order to generate candidate domain targets. |
|
http://www.sethf.com/domains/typos
(376 words)
|
|
| |
| | GUS Wiki - Wdk Classic |
 | | The class provieds the web form necessary to run agrep over a file and returns the result to the user. |  | | If you want to get fancy, you can supply a Perl regular expression to reformat the result in the ResultSub parameter. |
|
http://www.gusdb.org/wiki/index.php/WdkClassic
(341 words)
|
|
| |
| | www.diecorp.com Computers Software File Management Search |
 | | » AGREP for VMS - A Unix "grep" program with approximate matching capabilities, ported to VMS by J.Lauret. |  | | » AGREP, an approximate GREP - Powerful tool for fast approximate fuzzy text searching files for strings, regular expressions. |
|
http://www.diecorp.com/Top/Computers/Software/File_Management/Search
(1321 words)
|
|
| |
| | Domains with Typographical Errors - A Google Search Strategy |
 | | That is, the agrep results might have several variants for the target domain name, but the Google result would have only the relevant match. |  | | This due to agrep regarding a transposition as two changes in the string (as two letters are different), which makes such changes rank low, and exceeded the setting used in the agrep-based results. |  | | In the table below, in an attempt to find the target of a typo'ed domain name, a "hit" was defined as having at least one result from the procedure, while a "miss" meant the procedure had no results. |
|
http://www.sethf.com/domains/typos-google
(620 words)
|
|
| |
| | Performance analysis |
 | | For example, if the data specified by the next read call depends on the data returned by the currently outstanding read call, then speculative execution will not be able to hint the next read call. |  | | The number of read calls is sometimes larger than the number of read blocks because, for example, Agrep issues at least one extra read call per file to detect the end of the file. |  | | During the process of manually modifying an application to issue hints, programmers can make the application more amenable to prefetching by restructuring the code to increase the number of cycles between dependent read calls. |
|
http://www.usenix.org/events/osdi99/full_papers/chang/chang_html/node14.html
(1006 words)
|
|
| |
| | Magic-Dic - The Magic-Dictionary - for Unix/Linux |
 | | Magic-Dic can run in loop and read out with mouse selected words or text every time the user marks something. |  | | User of that operation system can download a unix shell for Windows: |  | | Magic-Dic is written as bash scipt using agrep to be simply used in terminals, because entering a simple command line is finally faster and easier, than opening a graphical program every time one needs a translation. |
|
http://magic-dic.homeunix.net
(728 words)
|
|
| |
| | AFS NASD running agrep benchmark |
 | | The agrep benchmark requires clients to locate and process files in a multilevel tree. |  | | As the number of client/disk pairs increases, the average time for the NASD configuration remains constant while it increases for SAD, suggesting that the fileserver represents a scalability bottleneck. |  | | NASD clients must obtain capabilities to access each of these files before contacting the drive. |
|
http://www.pdl.cmu.edu/NASD/afs_agrep.html
(58 words)
|
|
| |
| | hakank.blogg: Fortsatt stavning av "Henning Mankell". Samt lite om agrep |
 | | Unlike other agrep implementations (like the one by Sun Wu and Udi Manber from University of Arizona available here) TRE agrep allows full regexps of any length, any number of errors, and non-uniform costs for insertion, deletion and substitution. |  | | TRE includes a version of the agrep command line tool for approximate regexp matching in the style of grep. |  | | Det finns ett annat system för approximerad sökning: TRE (0.6.6 i skrivande stund) som har en version av agrep som verkar skoj. |
|
http://www.hakank.org/webblogg/archives/000651.html
(877 words)
|
|
| |
| | agrep - Approximate General Regular Expression Pattern Matcher |
 | | The best available tools for approximate pattern matching are glimpse and agrep, developed by Manber and Wu at the University of Arizona, and available from http://glimpse.cs.arizona.edu/. |  | | Glimpse is a tool for building and using an index for searching through file systems, while agrep (approximate general regular expression pattern matcher) is a tool supporting text search with spelling errors. |  | | This page last modified on Oct 13, 1996. |
|
http://www.cs.toronto.edu/~yuana/AlgorithmManual/WEBSITE/IMPLEMEN/AGREP/IMPLEMEN.HTM
(82 words)
|
|
| |
| | gatekeeper.dec.com archive index search form |
 | | You probably want to avoid making queries for single strings like DEC or msdos, as you'll get several MBytes returned. |  | | agrep supports standard regular expresions, plus a boolean and operation `;' and a boolean or operation `,'. |  | | The index is generated nightly and includes last modified dates and file sizes. |
|
http://gatekeeper.dec.com/cgi-bin/grep-index
(126 words)
|
|
| |
| | An old fix to agrep breaks functionality |
 | | Hi, I wanted to upgrade glimpse from version 4.0 to the 4.17.4 and noticed, that I miss search results when pattern is long (>30 chars). |  | | After investigation I traced the bug to choosing limited (to 32 chars) search algorithm within agrep code when record delimiter was specified on command line. |  | | Subject: An old fix to agrep breaks functionality |
|
http://www.webglimpse.org/wgdev/msg00068.html
(142 words)
|
|
| |
| | ideas/agrep.INST |
 | | Redisplay this file through Enscript (monochrome) or through Vim (coloured), or go back to the list of project files. |  | | agrep 2.01 Sun Wu Udi Manber 1992-05-10 comp.sources.unix, Volume 26, Issues 021-023 (3 parts) 1992-06-18 icule make $INSTALL_PROGRAM agrep /usr/local/bin $INSTALL_DATA agrep.1 /usr/local/man/man1 |
|
http://wdiff.progiciels-bpi.ca/showfile.html?name=ideas/agrep.INST
(63 words)
|
|
| |
| | Citations: personal communication - Hume (ResearchIndex) |
 | | In our experiments, agrep was competitive with the best exact string matching tools that we could find (Hume s gre |  | | Agrep not only supports a large number of options, but it is also very efficient. |  | | Examples of the use of agrep are given in the next section. |
|
http://citeseer.ist.psu.edu/context/11470/0
(313 words)
|
|
| |
| | USENET Cookbook - Metric |
 | | The number field below tells agrep how many errors to allow in approximate pattern matching. |  | | The search output now includes: a recipe name and description along with the actual words in the recipe which matched the search (in []'s under the name). |  | | Searching the USENET Cookbook has recently been improved by the creation of a word index for all the recipes. |
|
http://www.astro.cf.ac.uk/misc/recipe
(152 words)
|
|
| |
| | DAG: agrep Buildlogs |
 | | They may be of use when debugging packaging problems or software issues. |  | | These are the buildlogs of the agrep package. |  | | » Dag Wieers » RPM packages » [agrep] » agrep Buildlogs |
|
http://dag.wieers.com/packages/agrep/_buildlogs?M=D
(51 words)
|
|
| |
| | agrep version 2.04 for Mac OSX 10.4 Tiger |
 | | If you haven't already installed Darwinports, you can find easy instructions for doing so at the main Darwinports page. |  | | Once the software has been installed, you can find further information about using agrep with these commands: |  | | Find related portfiles with the new Darwinports.com search feature. |
|
http://agrep.darwinports.com
(259 words)
|
|
| |
| | Various bits of software |
 | | This OCaml library implements the Wu-Manber algorithm for string searching with errors, popularized by the "agrep" Unix command and the "glimpse" file indexing tool. |  | | It was developed as part of a search engine for a largish MP3 collection; the "with error" searching comes handy for those who can't spell Liszt or Shostakovitch. |  | | More generally, the library promotes a "Lego"-like style of constructing and composing transformations over character streams. |
|
http://cristal.inria.fr/~xleroy/software.html
(780 words)
|
|
| |
| | Search The Fred Frith Discography |
 | | Approximate matching allows you to specify the number errors (essentially spelling errors) allowed. |  | | Negate Sense of Query (this can lead to lots of hits...) |  | | Best Match Mode (Disabled due to a bug in agrep) |
|
http://www.lib.uchicago.edu/~keith/frith/discography.html
(73 words)
|
|
| |
| | AGREP, an approximate GREP. The Help Page |
 | | To search for string" in all files *.c and to pipe the result into a file x.x, use the following command: |  | | The output redirection part >x.x is moved to the start of the command line to avoid any problems of command line parsing by the operating system and/or compiler specific problems. |  | | The sequence \\\" appears in AGREP as \" (this means: search for "). |
|
http://www.tgries.de/agrep/agrephlp.html
(580 words)
|
|
|