Backup_Repos/stash

mirror of https://github.com/stashapp/stash.git synced 2026-05-05 19:10:27 +02:00

Author	SHA1	Message	Date
WithoutPants	f69bd8a94f	Restructure go project (#2356 ) * Move main to cmd * Move api to internal * Move logger and manager to internal * Move shell hiding code to separate package * Decouple job from desktop and utils * Decouple session from config * Move static into internal * Decouple config from dlna * Move desktop to internal * Move dlna to internal * Decouple remaining packages from config * Move config into internal * Move jsonschema and paths to models * Make ffmpeg functions private * Move file utility methods into fsutil package * Move symwalk into fsutil * Move single-use util functions into client package * Move slice functions to separate packages * Add env var to suppress windowsgui arg * Move hash functions into separate package * Move identify to internal * Move autotag to internal * Touch UI when generating backend	2022-03-17 11:33:59 +11:00
WithoutPants	9e3d56b22f	Fix identify and script scraper bugs (#2375 ) * Continue identify if source fails * Handle empty result set correctly * Parse null values from scraper script correctly * Omit warning when json selector value missing * Return nil when scraped item not found * Fix graphql validation errors	2022-03-15 09:42:22 +11:00
WithoutPants	d88515abcd	Autotag optimisation (#2368 ) * Add duration to autotag finish message * No sorting scene/image/gallery where not specified * Use an LRU cache for sqlite regexp function * Compile path separator regex once * Cache objects with single letter first names * Move finished auto-tag log * Add more verbose logging * Add new changelog	2022-03-09 12:01:56 +11:00
WithoutPants	d7473f4b38	Distance match phashes on bulk stash-box query (#2355 )	2022-03-03 09:38:37 +11:00
Releck	22321c2b62	Fix performer tags not applying on scene scrapers (#2339 )	2022-02-22 10:18:29 +11:00
kermieisinthehouse	0e514183a7	Desktop integration (#2073 ) * Open stash in system tray on Windows/MacOS * Add desktop notifications * MacOS Bundling * Add binary icon Co-authored-by: WithoutPants <53250216+WithoutPants@users.noreply.github.com>	2022-02-03 11:20:34 +11:00
InfiniteTF	a3c20ce8da	Add support for submitting performer/scene drafts to stash-box (#2234 ) * Add support for submitting performer/scene drafts to stash-box Co-authored-by: Kermie <kermie@isinthe.house>	2022-02-01 15:06:51 +11:00
bnkai	be5dc7e545	Resolve hostname for chromium RDP requests (#2174 )	2022-01-04 15:47:39 +11:00
InfiniteTF	34aea876e8	Add stash-box credentials validation (#2173 )	2022-01-04 14:20:31 +11:00
InfiniteTF	bd784cdf96	Fix conversion of multi word stash-box enums (#2191 )	2022-01-04 12:55:45 +11:00
bnkai	66dd239732	Skip cleaning for search by name scrape queries (#2059 ) * Skip pp for search by name queries * upgrade htmlquery	2021-12-16 11:18:39 +11:00
InfiniteTF	f3ab6578d9	Add performer aliases to stash-box tagging/scraping (#2091 ) * Add performer aliases to stash-box tagging/scraping	2021-12-08 09:36:06 +11:00
SmallCoccinelle	cf4ab843f6	Fix setting images (#2068 ) When postprocessing, pass the images by reference rather than value, so we get the Image fields populated correctly in the output.	2021-11-29 14:54:01 +11:00
SmallCoccinelle	4089fcf1e2	Scraper refactor middle (#2043 ) * Push scrapeByURL into scrapers Replace ScrapePerfomerByURL, ScrapeMovie..., ... with ScrapeByURL in the scraperActionImpl interface. This allows us to delete a lot of repeated code in the scrapers and replace the central part with a switch on the scraper type. * Fold name scraping into one call Follow up on scraper refactoring. Name scrapers use the same code path. This allows us to restructure some code and kill some functions, adding variance to the name scraping code. It allows us to remove some code repetition as well. * Do not export loop refs. * Simplify fragment scraping Generalize fragment scrapers into ScrapeByFragment. This simplifies fragment code flows into a simpler pathing which should be easier to handle in the future. * Eliminate more context.TODO() In a number of cases, we have a context now. Use the context rather than TODO() for those cases in order to make those operations cancellable. * Pass the context for the stashbox scraper This removes all context.TODO() in the path of the stashbox scraper, and replaces it with the context that's present on each of the paths. * Pass the context into subscrapers Mostly a mechanical update, where we pass in the context for subscraping. This removes the final context.TODO() in the scraper code. * Warn on unknown fields from scripts A common mistake for new script writers are that they return fields not known to stash. For instance the name "description" is used rather than "details". Decode disallowing unknown fields. If this fails, use a tee-reader to fall back to the old behavior, but print a warning for the user in this case. Thus, we retain the old behavior, but print warnings for scripts which fails the more strict unknown-fields detection. * Nil-check before running the postprocessing chain Fixes panics when scraping returns nil values. * Lift nil-ness in post-postprocessing If the struct we are trying to post-process is nil, we shouldn't enter the postprocessing flow at all. Pass the struct as a value rather than a pointer, eliminating nil-checks as we go. Use the top-level postProcess call to make the nil-check and then abort there if the object we are looking at is nil. * Allow conversion routines to handle values If we have a non-pointer type in the interface, we should also convert those into ScrapedContent. Otherwise we get errors on deprecated functions.	2021-11-26 11:20:06 +11:00
WithoutPants	19e69f5310	Prefer studio name later in filename (#2057 )	2021-11-26 08:29:25 +11:00
SmallCoccinelle	c1f89611e2	Refactor scraper top half (#1893 ) * Simplify scraper listing Introduce an enum, scraper.Kind, which explains what we are looking for. Make it possible to match this from a scraper struct. Use the enum to rewrite all the listing code to use the same code path. * Use a map, nitpick ScrapePerformerList Let the cache store a map from ID of a scraper to the scraper. This improves lookups when there are many scrapers, making it practically O(1) rather than O(n). If many scrapers are stored, this is faster. Since range expressions work unchanged, we don't have to change much, and things will still work. make Kind a Stringer Rename ScraperPerformerList -> ScraperPerformerQuery since that name is used in the other scrapers, and we value consistency. Tune ScraperPerformerQuery: * Return static errors * Use the new functionality * When loading scrapers, do so directly Rather than first walking the directory structure to obtain file paths, fold the load directly in the the filepath walk. This makes the code for more direct. * Use static ErrNotFound If a scraper isn't found, return one static error. This paves the way for eventually doing our own error-presenter in gqlgen. * Store the cache in the Resolver state Putting the scraperCache directly in the resolver avoids the need to call manager.GetInstance() all over the place to get access to the scraper cache. The cache is stored by pointer, so it should be safe, since the cache will just update its internal state rather than being overwritten. We can now utilize the resolver state to grab the cache where needed. While here, pass context.Context from the resolver down into a function, which removes a context.TODO() * Introduce ScrapedContent Create a union in the GraphQL schema for all scraped content. This simplifies the internal implementation because we get variance on the output content type. Introduce a new type ScrapedContentType which signifies the scraped content you want as a caller. Use these to generalize the List interface and the URL scraping interface. * Simplify the scraper API Introduce a new interface for scraping. This interface is then used in the upper half of the scraper code, to make the code use one code flow rather than multiple code flows. Variance is currently at the old scraper structure. Add extending interfaces for the different ways of invoking scrapes. Use interface conversions to convert a scraper from the cache to a scraper supporting the extra methods. The return path returns models.ScrapedContent. Write a general postProcess function in the scraper, handling all ScrapedContent via type switching. This consolidates all postprocessing code flows. Introduce marhsallers in the resolver code for converting ScrapedContent into the underlying concrete types. Use this to plug the existing fields in the Query resolver, so everything still works. * ScrapedContent: add more marshalling functions Handle all marshalling of ScrapedContent through marhsalling functions. Removes some hand-rolled early variants of it, and replaces it with a canonical code flow. * Support loadByName via scraper_s In order to temporarily plug a hole in the current implementation, we use the older implementation as a hook to get the newer implementation to run. Later on, this can serve as a guide for how to implement the lower level bits inside the scrapers themselves. For now, it just enables support. * Plug the remaining scraper functions for now Since we would like to have a scraper which works in between refactors, plug the lower level parts of the scraper for now. It avoids us having to tackle this part just yet. * Move postprocessing to its own file There's enough postprocessing to clutter the main scrapers.go file. Move all of this into a new file, postprocessing to make the API simpler. It now lives in scrapers.go. * Scraper: Invoke API consistency scraper.Cache.ScrapeByName -> ScrapeName * Fix scraping scenes by URL Simple typo. While here, also make a single marshaller nil-aware. * Introduce scraper groups, consolidate loadByURL Rename `scraper_s` into `group`. A group is a group of scrapers with the same identity. This corresponds to a single YAML file for a scraper configuration. It defines a group which supports different types of scraping contexts. Move config into the group, and lift txnManager and globalConfig to the group. Because we now return models.ScrapedContent we can use interfaces to get variance from the different underlying scrapers. Use a type switch for the URL matcher candidates. And then again for the scrapers. This consolidates all URL scraping paths into one. While here, remove the urlMatcher interface which isn't needed. Also clean up the remaining interfaces for url scraping and delete code which has no purpose anymore. * Consolidate fragment scraping in one code path While here, abide the linters checks. * Refactor loadByFragment Give it the same treatment as loadByURL: Step 1: find a scraperActionImpl which works for the data. Step 2: use that to scrape Most of this is simple analysis on the data at hand. It can be pushed down further in a later commit, but for now we leave it here. * Remove configScraper, autotag is a scraper Remove the remains of the configScraper struct. It now lives on in the group struct. Kill the remaining interfaces from the old implementation while here. Remove group.specification since it can now be handled by a simple func call to spec(). Work through the autotag scraper. It now implements the scraper interface, so it can be used as a scraper. This also simplifies the autotag scraper quite a bit since it doens't have to implement a number of unsupported func calls. * Simplify the fragment scraper flow * Pass the context Eliminate a round of context.TODO() in the scraper code by passing the calling context down into the subsystem. This will gracefully allow for termination of remote calls if the client goes away for some reason in GraphQL requests. * Improve listScrapers in the schema Support lists of types we accept. * Be graceful on nil values in conversion Supporting nil-values make the API more robust in the case of partial results in a multi-scrape situation. * Improve listScrapers: output at-most-once Use the ID of a scraper to reduce the output set. If a scraper has been included, don't include it again. * Consolidate all API level errors into resolver.go * Reorder files and functions: scrapers.go -> cache.go: It almost contains nothing but the cache code. Move errors into scraper.go from here because It is a better place to have them living right now group.go: All of the group structure. This can now go from scraper.go, making it more lean. Move group create from config_scraper to here. config.go: Move the `(c config) spec()` call to here. config_scraper.go: Empty file by now * Name-update the scraper interfaces Use 'via' rather than 'loadBy'. The scrape happens via a given scrape method, so I think this is a nice name for it. * Rename scrapers for consistency. While here, improve the error formatting, so different errors come back differently. * Nuke the freeones field from the GraphQL schema * Fix autotag interfacing, refactor The autotag scraper uses a pointer receiver, but the rest of the code we use for scraping doesn't expect a pointer-receiver. Hence, to fix the autotag scraper, we change it to be a value receiver, like the rest of the code. Fix: viaScene, and viaGallery. While here, remove a couple of pointer-receiver methods which can be trivially rewritten into plain functions. * Protect against pointer interfaces The underlying code can be a bit inconsistent in what it returns. Introduce pointer-types in the postprocessing layer and handle them accordingly for now. Once a better understanding of the lower levels are understood, we can lift this. * Move ErrConversion into the models package. The conversion error pertains to the logic of converting models. Because of this, it should move there, so it is centralized. * Be consistent in scraper resolver error handling If we have a static error Err = errors.New(..) Then use it wrapped at the start: fmt.Errorf("%w: ...context...", Err) This reads better. While here, avoid using the underlying Atoi errors: they are verbose, and like 99% of the time, the user know what is wrong from the input string, so just give that back. Also, remove the scraper id from the error contexts: it is implicit, and the error wouldn't change if we used a different scraper, which the error message would imply. * Mark the listScrapers() API as deprecated The same functionality is now present in listScrapers. Improve error formatting Think about how each error is going to be used and tweak them to be nicer. * Return a sorted list of scrapers This helps testing, it's closer to what we had, caches like stable data, and it is easier for humans. It also makes the output stable, because map iteration is randomized. * Fix listScrapers calls to return in ID-order Since we need the ordering to be by ID in all situations, it is easier to just generalize the cache listScrapers call to support multiple scraper types. This avoids a de-dupe map up the chain, since every scraper is only considered once. Sorting now happens in the cache listScrapers call. Use this generalized function in all resolvers, which are now simple passthroughs. * Remove UpdateConfig from the scraper cache. This isn't needed, so get rid of it. * Pull a context into identify Scraping scenes in the identify tasks now use a context from up the call chain. * Do not store the scraper cache in the resolver. Scraper caches are updated through manager.singleton•RefreshScraperCache, so we can't keep a pointer to it in the resolver. Instead, solve this by adding a fetcher method to the resolver type. This keeps it local to the resolver, while handling the problem of updating caches in the configuration.	2021-11-19 10:55:34 +11:00
InfiniteTF	c571687c99	Resolve performer/studio stashIDs for scraped scenes (#2006 ) * Resolve performer/studio stashIDs for scraped scenes * Check endpoint when matching stashids	2021-11-15 07:51:52 +11:00
InfiniteTF	808202ba8a	Fix performer tagger field updating (#1977 ) * Fix performer tagger field updating	2021-11-11 11:34:46 +11:00
WithoutPants	0f64954e5b	Identify task (#1839 ) * Add identify task * Change type naming * Debounce folder select text input * Add generic slice comparison function	2021-10-28 14:25:17 +11:00
InfiniteTF	15acf91b90	Add PHash distance matching to stash-box integration (#1858 ) * Add PHash distance matching to stash-box integration	2021-10-20 17:22:25 +11:00
SmallCoccinelle	e513b6ffa5	Cache and reuse the scraper HTTP client (#1855 ) * Add Cookies directly to the request Rather than maintaining a cookie jar on a one-shot HTTP client, maintain the jar ourselves: make a new jar, then use it to select the right cookies. The cookies are set on the request rather than on the client. This will retain the current behavior as we are always throwing the client away after each use. This patch enables the lifting of the http client as well over time. * Introduce a cached scraper HTTP client The scraper cache is augmented with an http.Client. These are safe for concurrent use, so the pointer can safely be passed around. Push this into scraper configurations where applicable, next to the txnManagers. When we issue a loadUrl request, do so on the cached http.Client, which will reuse existing idle connections in the client if any are present. * Set MaxIdleConnsPerHost. Closes #1850 We allow for up to 8 idle connections to a single host. This should make concurrent operation toward the same host reuse connections, even for sizeable concurrency. The number isn't bumped excessively high. We should probably limit concurrency toward a single site anyway, since we'll be able to overrun a site with queries quite easily if we have many concurrent goroutines issuing requests at the same time. * Reinstate driverOptions / useCDP check Use DeMorgan's laws to invert the logic and exit early. Fixes tests breaking. * Documentation fixup. * Use the scraper http.Client when fetching images Fold image fetchers onto the cached scraper http.Client as well. This makes the scraper have a single http.Client cache for all its operations. Thread the client upwards to the relevant attachment points: either the cache, or a stash_box instance, which is extended to include a pointer to the client. Style roughly follows that of txnManagers. * Use the same http Client as the GraphQL client use Rather than using http.DefaultClient, use the same client as the GraphQL client use in the stash_box subsystem. This localizes the client used in the subsystem into the constructing New.. call. * Hoist HTTP client construction Create a function for initializaing the HTTP Client we use. While here hoist magic numbers into constants. Introduce a proper static redirect error and use it in the client code as well. * Reinstate printCookies This is a debugging function, and it might still come in handy in the future at some point. * Nitpick comment. * Minor tidy Co-authored-by: WithoutPants <53250216+WithoutPants@users.noreply.github.com>	2021-10-20 16:12:24 +11:00
SmallCoccinelle	e14bb8432c	Enable gocritic (#1848 ) * Don't capitalize local variables ValidCodecs -> validCodecs * Capitalize deprecation markers A deprecated marker should be capitalized. * Use re.MustCompile for static regexes If the regex fails to compile, it's a programmer error, and should be treated as such. The regex is entirely static. * Simplify else-if constructions Rewrite else { if cond {}} to else if cond {} * Use a switch statement to analyze formats Break an if-else chain. While here, simplify code flow. Also introduce a proper static error for unsupported image formats, paving the way for being able to check against the error. * Rewrite ifElse chains into switch statements The "Effective Go" https://golang.org/doc/effective_go#switch document mentions it is more idiomatic to write if-else chains as switches when it is possible. Find all the plain rewrite occurrences in the code base and rewrite. In some cases, the if-else chains are replaced by a switch scrutinizer. That is, the code sequence if x == 1 { .. } else if x == 2 { .. } else if x == 3 { ... } can be rewritten into switch x { case 1: .. case 2: .. case 3: .. } which is clearer for the compiler: it can decide if the switch is better served by a jump-table then a branch-chain. * Rewrite switches, introduce static errors Introduce two new static errors: * `ErrNotImplmented` * `ErrNotSupported` And use these rather than forming new generative errors whenever the code is called. Code can now test on the errors (since they are static and the pointers to them wont change). Also rewrite ifElse chains into switches in this part of the code base. * Introduce a StashBoxError in configuration Since all stashbox errors are the same, treat them as such in the code base. While here, rewrite an ifElse chain. In the future, it might be beneifical to refactor configuration errors into one error which can handle missing fields, which context the error occurs in and so on. But for now, try to get an overview of the error categories by hoisting them into static errors. * Get rid of an else-block in transaction handling If we succesfully `recover()`, we then always `panic()`. This means the rest of the code is not reachable, so we can avoid having an else-block here. It also solves an ifElse-chain style check in the code base. * Use strings.ReplaceAll Rewrite strings.Replace(s, o, n, -1) into strings.ReplaceAll(s, o, n) To make it consistent and clear that we are doing an all-replace in the string rather than replacing parts of it. It's more of a nitpick since there are no implementation differences: the stdlib implementation is just to supply -1. * Rewrite via gocritic's assignOp Statements of the form x = x + e is rewritten into x += e where applicable. * Formatting * Review comments handled Stash-box is a proper noun. Rewrite a switch into an if-chain which returns on the first error encountered. * Use context.TODO() over context.Background() Patch in the same vein as everything else: use the TODO() marker so we can search for it later and link it into the context tree/tentacle once it reaches down to this level in the code base. * Tell the linter to ignore a section in manager_tasks.go The section is less readable, so mark it with a nolint for now. Because the rewrite enables a ifElseChain, also mark that as nolint for now. * Use strings.ReplaceAll over strings.Replace * Apply an ifElse rewrite else { if .. { .. } } rewrite into else if { .. } * Use switch-statements over ifElseChains Rewrite chains of if-else into switch statements. Where applicable, add an early nil-guard to simplify case analysis. Also, in ScanTask's Start(..), invert the logic to outdent the whole block, and help the reader: if it's not a scene, the function flow is now far more local to the top of the function, and it's clear that the rest of the function has to do with scene management. * Enable gocritic on the code base. Disable appendAssign for now since we aren't passing that check yet. * Document the nolint additions * Document StashBoxBatchPerformerTagInput	2021-10-18 14:12:40 +11:00
kermieisinthehouse	5ec70ac3e0	Fix List filter styles, fix freeones spam (#1853 ) * Fix List filter styles, fix freeones spam	2021-10-15 14:02:49 +11:00
SmallCoccinelle	655d3ae969	Toward better context handling (#1835 ) * Use the request context The code uses context.Background() in a flow where there is a http.Request. Use the requests context instead. * Use a true context in the plugin example Let AddTag/RemoveTag take a context and use that context throughout the example. * Avoid the use of context.Background Prefer context.TODO over context.Background deep in the call chain. This marks the site as something which we need to context-handle later, and also makes it clear to the reader that the context is sort-of temporary in the code base. While here, be consistent in handling the `act` variable in each branch of the if .. { .. } .. check. * Prefer context.TODO over context.Background For the different scraping operations here, there is a context higher up the call chain, which we ought to use. Mark the call-sites as TODO for now, so we can come back later on a sweep of which parts can be context-lifted. * Thread context upwards Initialization requires context for transactions. Thread the context upward the call chain. At the intialization call, add a context.TODO since we can't break this yet. The singleton assumption prevents us from pulling it up into main for now. * make tasks context-aware Change the task interface to understand contexts. Pass the context down in some of the branches where it is needed. * Make QueryStashBoxScene context-aware This call naturally sits inside the request-context. Use it. * Introduce a context in the JS plugin code This allows us to use a context for HTTP calls inside the system. Mark the context with a TODO at top level for now. * Nitpick error formatting Use %v rather than %s for error interfaces. Do not begin an error strong with a capital letter. * Avoid the use of http.Get in FFMPEG download chain Since http.Get has no context, it isn't possible to break out or have policy induced. The call will block until the GET completes. Rewrite to use a http Request and provide a context. Thread the context through the call chain for now. provide context.TODO() at the top level of the initialization chain. * Make getRemoteCDPWSAddress aware of contexts Eliminate a call to http.Get and replace it with a context-aware variant. Push the context upwards in the call chain, but plug it before the scraper interface so we don't have to rewrite said interface yet. Plugged with context.TODO() * Scraper: make the getImage function context-aware Use a context, and pass it upwards. Plug it with context.TODO() up the chain before the rewrite gets too much out of hand for now. Minor tweaks along the way, remove a call to context.Background() deep in the call chain. * Make NOTIFY request context-aware The call sits inside a Request-handler. So it's natural to use the requests context as the context for the outgoing HTTP request. * Use a context in the url scraper code We are sitting in code which has a context, so utilize it for the request as well. * Use a context when checking versions When we check the version of stash on Github, use a context. Thread the context up to the initialization routine of the HTTP/GraphQL server and plug it with a context.TODO() for now. This paves the way for providing a context to the HTTP server code in a future patch. * Make utils func ReadImage context-aware In almost all of the cases, there is a context in the call chain which is a natural use. This is true for all the GraphQL mutations. The exception is in task_stash_box_tag, so plug that task with context.TODO() for now. * Make stash-box get context-aware Thread a context through the call chain until we hit the Client API. Plug it with context.TODO() there for now. * Enable the noctx linter The code is now free of any uncontexted HTTP request. This means we pass the noctx linter, and we can enable it in the code base.	2021-10-14 15:32:41 +11:00
SmallCoccinelle	c6f6205e4f	Errorlint sweep + minor linter tweaks (#1796 ) * Replace error assertions with Go 1.13 style Use `errors.As(..)` over type assertions. This enables better use of wrapped errors in the future, and lets us pass some errorlint checks in the process. The rewrite is entirely mechanical, and uses a standard idiom for doing so. * Use Go 1.13's errors.Is(..) Rather than directly checking for error equality, use errors.Is(..). This protects against error wrapping issues in the future. Even though something like sql.ErrNoRows doesn't need the wrapping, do so anyway, for the sake of consistency throughout the code base. The change almost lets us pass the `errorlint` Go checker except for a missing case in `js.go` which is to be handled separately; it isn't mechanical, like these changes are. * Remove goconst goconst isn't a useful linter in many cases, because it's false positive rate is high. It's 100% for the current code base. * Avoid direct comparison of errors in recover() Assert that we are catching an error from recover(). If we are, check that the error caught matches errStop. * Enable the "errorlint" checker Configure the checker to avoid checking for errorf wraps. These are often false positives since the suggestion is to blanket wrap errors with %w, and that exposes the underlying API which you might not want to do. The other warnings are good however, and with the current patch stack, the code base passes all these checks as well. * Configure rowserrcheck The project uses sqlx. Configure rowserrcheck to include said package. * Mechanically rewrite a large set of errors Mechanically search for errors that look like fmt.Errorf("...%s", err.Error()) and rewrite those into fmt.Errorf("...%v", err) The `fmt` package is error-aware and knows how to call err.Error() itself. The rationale is that this is more idiomatic Go; it paves the way for using error wrapping later with %w in some sites. This patch only addresses the entirely mechanical rewriting caught by a project-side search/replace. There are more individual sites not addressed by this patch.	2021-10-12 14:03:08 +11:00
WithoutPants	e9d48683f8	Autotag scraper (#1817 ) * Refactor scraper structures * Move matching code into new package * Add autotag scraper * Always check first letter of auto-tag names * Account for nulls Co-authored-by: Kermie <kermie@isinthe.house>	2021-10-11 23:06:06 +11:00
SmallCoccinelle	a5ca8fc678	Enable safe linters (#1786 ) * Enable safe linters Enable the linters dogsled, rowserrcheck, and sqlclosecheck. These report no errors currently in the code base. Enable misspell. Misspell finds two spelling mistakes in comments, which are fixed by the patch as well. Add and sort linters which are relatively safe to add over time. Comment them out for now. * Close the response body If we can get a HTTP response, it has a body which ought to be closed. By doing so, we avoid potentially leaking connections. * Enable the exportloopref linter There are two places in the code with these warnings. Fix them while enabling the linter. * Remove redundant types in tests If a slice already determines the type, the inner type declaration is redundant. Remove the inner declarations. * Mark autotag test cases as parallel Autotag test cases is by far the outlier when it comes to test time. While go test runs test cases in parallel, it doesn't do so inside a given package, unless one marks the test cases as parallel. This change provides a significant speedup on a 8-core machine for test runs.	2021-10-03 11:48:03 +11:00
Eng Zer Jun	62af723017	refactor: move from io/ioutil to io and os package (#1772 ) The io/ioutil package has been deprecated as of Go 1.16, see https://golang.org/doc/go1.16#ioutil. This commit replaces the existing io/ioutil functions with their new definitions in io and os packages. Signed-off-by: Eng Zer Jun <engzerjun@gmail.com>	2021-09-27 10:55:23 +10:00
SmallCoccinelle	a9e2a590b2	Lint checks phase 2 (#1747 ) * Log 3 unchecked errors Rather than ignore errors, log them at the WARNING log level. The server has been functioning without these, so assume they are not at the ERROR level. * Log errors in concurrency test If we can't initialize the configuration, treat the test as a failure. * Undo the errcheck on configurations for now. * Handle unchecked errors in pkg/manager * Resolve unchecked errors * Handle DLNA/DMS unchecked errors * Handle error checking in concurrency test Generalize config initialization, so we can initialize a configuration without writing it to disk. Use this in the test case, since otherwise the test fails to write. * Handle the remaining unchecked errors * Heed gosimple in update test * Use one-line if-initializer statements While here, fix a wrong variable capture error. * testing.T doesn't support %w use %v instead which is supported. * Remove unused query builder functions The Int/String criterion handler functions are now generalized. Thus, there's no need to keep these functions around anymore. * Mark filterBuilder.addRecursiveWith nolint The function is useful in the future and no other refactors are looking nice. Keep the function around, but tell the linter to ignore it. * Remove utils.Btoi There are no users of this utility function * Return error on scan failure If we fail to scan the row when looking for the unique checksum index, then report the error upwards. * Fix comments on exported functions * Fix typos * Fix startup error	2021-09-23 17:15:50 +10:00
gitgiggety	b83ce29ac4	Scraper log improvements (#1741 ) * Fix logs from scraper and plugins not being shown in UI Using `logger.` in the logger package to write logs is "incorrect". This as the package contains a variable named `logger` which contains the logrus instance. So instead of the log line being handled by the custom log implementation / wrapper which makes sure the lines are shown in the UI as well, it's written to logrus directly meaning the wrapper is skipped. This "issue" is obviously triggered because in any other place `logger.X` can be used and it will used the custom logger package / wrapper which works fine. * Add plugin / scraper name to logging output Indicate which plugin / scraper wrote a log message by including its name to the `[Scrape]` prefix. * Add missing addLogItem call	2021-09-19 10:06:34 +10:00
stg-annon	d29699fa30	Support scraper logging to specific log levels (#1648 ) * init scrapper log levels * Refactor plugin logging Co-authored-by: WithoutPants <53250216+WithoutPants@users.noreply.github.com>	2021-09-17 09:09:44 +10:00
WithoutPants	1a3a2f1f83	Scrape scene by name (#1712 ) * Support scrape scene by name in configs * Initial scene querying * Add to manual	2021-09-14 14:54:53 +10:00
gitgiggety	04e5ac9c2f	Studio aliases (#1660 ) * Add migration to create studio aliases table * Refactor studioQueryBuilder.Query to use filterBuilder * Expand GraphQL API with aliases support for studio * Add aliases support for studios to the UI * List aliases in details panel * Allow editing aliases in edit panel * Add 'aliases' filter when searching * Find studios by alias in filter / select * Add auto-tagging based on studio aliases * Support studio aliases for filename parsing * Support importing and exporting of studio aliases * Search for studio alias as well during scraping	2021-09-09 18:13:42 +10:00
SmallCoccinelle	82a41e17c7	Avoid wrapping strings.Replace in Contains (#1710 ) The strings.Replace function counts the number of replacements. If 0, the original string is returned. Hence, there is no need to check if a replacement will happen before doing the work.	2021-09-09 14:10:39 +10:00
SmallCoccinelle	4b00d24248	Remove unused (#1709 ) * Remove stuff which isn't being used Some fields, functions and structs aren't in use by the project. Remove them for janitorial reasons. * Remove more unused code All of these functions are currently not in use. Clean up the code by removal, since the version control has the code if need be. * Remove unused functions There's a large set of unused functions and variables in the code base. Remove these, so it clearer what code to support going forward. Dead code has been eliminated. Where applicable, comment const-sections in tests, so reserved identifiers are still known. * Fix use-def of tsURL The first def of tsURL doesn't matter because there's no use before we hit the 2nd def. * Remove dead code assignment Setting logFile = "" is effectively dead code, because there's no use of it later. * Comment out found The variable 'found' is dead in the function (because no post-process action is following it). Comment it for now. * Comment dead code in tests These might provide hints as to what isn't covered at the moment. * Dead code removal In the case of constants where iota is involved, move the iota so it matches the current key values. This avoids problems with persistently stored key IDs.	2021-09-09 14:10:08 +10:00
SmallCoccinelle	e7f6cb22b7	Error strings noncapitalized (#1704 ) * Fix error string capitalization Error strings often follow another string. Hence, they should not be capitalized, unless referencing a name. * Uncapitalize more error strings While here, use %v on the error directly, which makes it easier to wrap the error later with %w if need be. * Uncapitalize more error strings While here, rename Url to URL as a nitpick.	2021-09-08 11:23:10 +10:00
WithoutPants	4625e1f955	Unify scrape refactor (#1630 ) * Unify scraped types * Make name fields optional * Unify single scrape queries * Change UI to use new interfaces * Add multi scrape interfaces * Use images instead of image	2021-09-07 11:54:22 +10:00
gitgiggety	dfd55346b2	Scrape tag exclusions (#1617 ) * Add config option for scraper tag exclusion patterns Add a config option for exclusing tags / tag patterns from the scraper results. * Handle tag exclusion patterns during scraping	2021-08-10 14:07:01 +10:00
bnkai	4c05535a13	Fix potential race condintion in CDP (#1536 )	2021-06-28 10:36:51 +10:00
peolic	be2fe1de26	Update `chromedp` to fix console errors (#1521 )	2021-06-23 08:05:58 +10:00
WithoutPants	c70faa2a53	Tag aliases (#1412 ) * Add Tag Update/UpdateFull * Tag alias implementation * Refactor tag page * Add aliases in UI * Include tag aliases in q filter * Include aliases in tag select * Add aliases to auto-tagger * Use aliases in scraper * Add tag aliases for filename parser	2021-05-26 14:36:05 +10:00
peolic	cc5ec650ae	Fix scraper date parser failing when parsing time (#1431 ) * Don't mutate the original scraped date `time.Parse` is case-sensitive for some values, `AM/pm` in particular	2021-05-26 07:29:51 +10:00
EnameEtavir	5c4351f338	Cleanup fixes (#1422 ) * cleanup: remove dead code removing some code that does nothing * cleanup: fixing usage of deprecated gqlgen/graphql api in api/changeset_translator * cleanup: changing to recommended comparison methods Changing byte and case-insensitive string comparison to the recommended methods. * cleanup: making staticcheck happy	2021-05-25 11:03:09 +10:00
EnameEtavir	dc453c193d	Fix: file close even if file was not opened (#1417 ) Fixed a bug where in many implementations of load-file functions the file-close was still executed even if the file-open resulted in an error.	2021-05-25 07:52:55 +10:00
gitgiggety	586d146fdb	Apply all post processors to performer (#1387 ) * Apply all post processors to performer Scraping a performer by fragment doesn't correctly work with tags. When tags are returned to the scraper then all are recognized as new. This is due to the post process method not being applied while it should be, as is done when scraping a performer by URL.	2021-05-21 12:32:28 +10:00
bnkai	ab24d0f625	Add subtractDays pp action to scraper (#1399 )	2021-05-21 12:20:12 +10:00
bnkai	bc9aa02835	Discard null values from scraper results (#1374 )	2021-05-16 16:40:54 +10:00
InfiniteTF	896c3874af	Stash-Box Performer Tagger (#1277 ) * Add bulk stash-box performer task * Add stash-box performer scraper to scrape with menu	2021-05-03 14:21:20 +10:00
bnkai	597576f5e6	Get distinct values from scraper (#1338 ) Co-authored-by: WithoutPants <53250216+WithoutPants@users.noreply.github.com>	2021-04-29 11:38:55 +10:00
bnkai	aedadc3857	Add lbToKg pp action to the scraper (#1337 )	2021-04-26 13:31:25 +10:00
julien0221	d673c4ce03	added details, deathdate, hair color, weight to performers and added details to studios (#1274 ) * added details to performers and studios * added deathdate, hair_color and weight to performers * Simplify performer/studio create mutations * Add changelog and recategorised Co-authored-by: WithoutPants <53250216+WithoutPants@users.noreply.github.com>	2021-04-16 16:06:35 +10:00
bnkai	cd6b6b74eb	Add http headers support to scraper (#1273 )	2021-04-16 15:42:56 +10:00
WithoutPants	f6ffda7504	Setup and migration UI refactor (#1190 ) * Make config instance-based * Remove config dependency in paths * Refactor config init * Allow startup without database * Get system status at UI initialise * Add setup wizard * Cache and Metadata optional. Database mandatory * Handle metadata not set during full import/export * Add links * Remove config check middleware * Stash not mandatory * Panic on missing mandatory config fields * Redirect setup to main page if setup not required * Add migration UI * Remove unused stuff * Move UI initialisation into App * Don't create metadata paths on RefreshConfig * Add folder selector for generated in setup * Env variable to set and create config file. Make docker images use a fixed config file. * Set config file during setup	2021-04-12 09:31:33 +10:00
InfiniteTF	c38660d209	Add phash generation and dupe checking (#1158 )	2021-04-12 09:04:40 +10:00
bnkai	2edcdeaeb9	Support today, yesterday when using parseDate in scrapers (#1261 )	2021-04-07 09:09:04 +10:00
bnkai	4299f113e0	Fix Freeones search (#1230 )	2021-03-25 10:01:56 +11:00
bnkai	68d4a4fe42	Add User Agent to image download reqs (#1222 )	2021-03-24 08:12:11 +11:00
WithoutPants	a0676d5c30	Performer tags (#1132 ) * Add scraping support for performer tags * Add performer count to tag cards * Refactor sqlite test setup * Add performer tag filtering in gallery and image * Add bulk update performer * Add Performers tab to tag page * Add count filters and sort bys for tags * Move scene count to icon in performer card #1148	2021-03-10 12:25:51 +11:00
SpedNSFW	bde5d07afb	Find correct python executable (#1156 ) * find correct python executable For script scrapers using python, both python and python3 are valid depending on the OS and running environment. To save users from having any issues, this change will find the correct executable for them. Co-authored-by: bnkai <bnkai@users.noreply.github.com>	2021-03-03 08:01:01 +11:00
bnkai	117e6326db	Expose url for URLReplace in JSON scrapeByURL and scrapeByFragment (#1150 ) * Expose url for URLReplace in JSON scrapeByURL and scrapeByFragment * Apply queryURLReplace to xpath scrapers Co-authored-by: WithoutPants <53250216+WithoutPants@users.noreply.github.com>	2021-03-02 09:19:56 +11:00
bnkai	144cd6e4f2	Skip insecure certificates check when scraping (#1120 ) * Ignore insecure certificates when scraping * add ScraperCertCheck to scraper config options	2021-03-01 11:47:39 +11:00
SpedNSFW	acbdee76de	Random strings for cookie values (#1122 )	2021-02-23 13:40:43 +11:00
bnkai	984a0c9247	Tweak scraper script error printing (#1107 )	2021-02-09 19:07:53 +11:00
SpedNSFW	714ae541d4	fix json unmarshal error return (#1109 )	2021-02-09 19:04:42 +11:00
InfiniteTF	4fd022a93b	Decouple galleries from scenes (#1057 )	2021-02-02 07:56:54 +11:00
Belley	86bfb64a0d	Fix freeones scraper (#1091 )	2021-02-01 08:15:50 +11:00
WithoutPants	1e04deb3d4	Data layer restructuring (#997 ) * Move query builders to sqlite package * Add transaction system * Wrap model resolvers in transaction * Add error return value for StringSliceToIntSlice * Update/refactor mutation resolvers * Convert query builders * Remove unused join types * Add stash id unit tests * Use WAL journal mode	2021-01-18 12:23:20 +11:00
bnkai	e883e5fe27	Add Mouse Click support for the CDP scraper (#827 )	2020-12-22 09:42:31 +11:00
bnkai	a96ab9ce6f	Add support for setting cookies in the scraper (#934 )	2020-12-01 16:34:09 +11:00
bnkai	aecbd236bc	Tune image referrer path (#968 )	2020-11-30 10:50:43 +11:00
bnkai	e62e74bff4	Use symwalk for scrapers (#938 )	2020-11-16 09:21:26 +11:00
Belley	94392c7c4d	Fixing image for Freeones Scrapers (#930 )	2020-11-07 09:36:26 +11:00
InfiniteTF	9ec762ae9a	Fix outstanding tagger issues (#912 ) * Fix potential image errors * Fix issue preventing favoriting of tagged performers * Add error handling in case of network issues * Show individual search errors * Unset scene results if query fails * Don't abort scene submission if scene id isn't found	2020-11-05 08:28:58 +11:00
InfiniteTF	3346f8dcca	Stash-box tagger integration (#454 )	2020-10-24 14:31:39 +11:00
WithoutPants	70f73ecf4a	Update freeones scraper (#881 )	2020-10-24 13:12:21 +11:00
WithoutPants	109e55a25a	Query url parameters (#878 )	2020-10-22 11:56:04 +11:00
SpedNSFW	147d0067f5	Add gallery scraping (#862 )	2020-10-21 09:24:32 +11:00
WithoutPants	7a45943e8e	Stash box client interface (#751 ) * Add gql client generation files * Update dependencies * Add stash-box client generation to the makefile * Move scraped scene object matchers to models * Add stash-box to scrape with dropdown * Add scrape scene from fingerprint in UI	2020-09-17 19:57:18 +10:00
WithoutPants	9a84726128	Fix xpath comment element parsing (#759 )	2020-08-23 17:39:15 +10:00
woodgen	e3ea3ea85e	scraper/mapped: Add feetToCm post process. (#711 ) This patch adds a feetToCm post process that converts imperial feet and inches to centimeters.	2020-08-12 11:17:43 +10:00
woodgen	4045ddf3e9	Implement scraping movies by URL (#709 ) * api/urlbuilders/movie: Auto format. * graphql+pkg+ui: Implement scraping movies by URL. This patch implements the missing required boilerplate for scraping movies by URL, using performers and scenes as a reference. Although this patch contains a big chunck of ground work for enabling scraping movies by fragment, the feature would require additional changes to be completely implemented and was not tested. * graphql+pkg+ui: Scrape movie studio. Extends and corrects the movie model for the ability to store and dereference studio IDs with received studio string from the scraper. This was done with Scenes as a reference. For simplicity the duplication of having `ScrapedMovieStudio` and `ScrapedSceneStudio` was kept, which should probably be refactored to be the same type in the model in the future. * ui/movies: Add movie scrape dialog. Adds possibility to update existing movie entries with the URL scraper. For this the MovieScrapeDialog.tsx was implemented with Performers and Scenes as a reference. In addition DurationUtils needs to be called one time for converting seconds from the model to the string that is displayed in the component. This seemed the least intrusive to me as it kept a ScrapeResult<string> type compatible with ScrapedInputGroupRow.	2020-08-10 15:34:15 +10:00
WithoutPants	7158e83b75	Add JSON scrape support (#717 ) * Add support for scene fragment scrape in xpath	2020-08-10 14:21:50 +10:00
WithoutPants	5992ff8706	Add oshash support (#667 )	2020-08-06 11:21:14 +10:00
WithoutPants	b166abfa7b	Fix scraping error (#704 )	2020-08-04 20:43:56 +10:00
bnkai	4373f9bf01	Add cdp support for xpath scrapers (#625 ) Co-authored-by: WithoutPants <53250216+WithoutPants@users.noreply.github.com>	2020-08-04 10:42:40 +10:00
WithoutPants	2b9215702e	Refactor xpath scraper code. Add fixed and map (#616 ) * Refactor xpath scraper code * Make post-process a list * Add map post-process action * Add fixed xpath values * Refactor scrapers into cache * Refactor into mapped config * Trim test html	2020-07-21 14:06:25 +10:00
bnkai	56210cf456	Use referer on xpath getImage, apply printHTML to subscraper also (#661 )	2020-07-10 08:42:06 +10:00
bnkai	f8048dc27c	Increase xpath redirects, use cookies (#624 )	2020-06-22 12:18:02 +10:00
bnkai	9d0522f62d	Add "split" xpath in post-processing , newlines in replace support (#579 )	2020-06-18 10:47:10 +10:00
bnkai	a7ac02fb50	freeones fixes (#615 )	2020-06-17 11:02:06 +10:00
bnkai	f40e234748	Apply xpath parseDate after subScraper (#606 )	2020-06-15 21:38:59 +10:00
WithoutPants	d8ce137086	Reload scrapers button (#592 ) * Add reload scraper option to performer details * Add scraper reload to scene edit page * Show scene scraper menu when no queryable scrapers * Add 0.3 changelog	2020-06-10 13:43:17 +10:00
bnkai	b89956de25	freeones scraper fixes/tweaking (#584 )	2020-06-02 09:45:37 +10:00
bnkai	ccd75731b7	Change scrape matching (studio, movies, tag, performers) to case insensitive (#556 ) * Change scrape matching (studio, movies, tag, performers) to case insensitive * * fix collate order * * make filename parser findbyname calls case insensitive * * add unit testing for Tags GetFindbyName/s	2020-05-24 16:19:22 +10:00
WithoutPants	ec420df871	Add debug logging for xpath scraping (#555 ) * Add debug logging for xpath scraping * Add logging for processing scene members	2020-05-20 22:46:00 +10:00
WithoutPants	05488d59c3	Find scrapers in subdirectories (#554 )	2020-05-19 08:44:33 +10:00
bnkai	0fc57ce1e0	Fix xpath comments text (#550 )	2020-05-18 12:26:20 +10:00
WithoutPants	215c4e3bde	Change builtin freeones scraper to community yml (#542 )	2020-05-15 20:10:20 +10:00
bnkai	0b50e83dbf	freeones scraper tweaks (#509 )	2020-05-04 14:11:49 +10:00
WithoutPants	3d22d5a742	Refactor build (#493 ) * Add lint/format checks to build * Make travis get full repo to get tags * Run packr2 once in cross-compile * Fix quotes in package.json * Fix linting issues * Formatting * Fix vet issue * Fix go lint issues * Show start of each platform compilation * Add validate target * Set gitattributes for go fmt and mod vendor * Fix tag name * Add fmt-ui target	2020-04-29 12:13:08 +10:00
WithoutPants	82201e23e0	Make ethnicity freetext and fix freeones ethnicity panic (#431 ) * Make ethnicity free text * Fix panic in freeones scraper for other ethnicity	2020-04-02 08:25:39 +11:00
WithoutPants	abf2b49803	Configurable scraper user agent string (#409 ) * Add debug scrape option. Co-authored-by: HiddenPants255 <>	2020-03-21 08:55:15 +11:00
WithoutPants	34d829338d	Add image scraping support (#370 ) * Add sub-scraper functionality * Add scraping of performer image * Add scene cover image scraping * Port UI changes to v2.5 * Fix v2.5 dialog suggest color * Don't convert eol of UI to support pretty	2020-03-11 11:41:55 +11:00
caustico	5fb8bbf768	Movies Section (#338 ) Co-authored-by: WithoutPants <53250216+WithoutPants@users.noreply.github.com>	2020-03-10 14:28:15 +11:00
WithoutPants	03c07a429d	Add Xpath post processing and performer name query (#333 ) * Extend xpath configuration. Support concatenation * Add parseDate parsing option * Add regex replacements * Add xpath query performer by name * Fix loading spinner on scrape performer * Change ReplaceAll to Replace	2020-01-31 17:17:40 -05:00
WithoutPants	78eb527ec4	Scraper fixes (#332 ) * Fix panic on invalid xpath * Add missing attrs to scraped performer fragment	2020-01-24 22:36:24 -05:00
WithoutPants	7fdaccf669	Xpath scraping from URL (#285 ) * Add xpath performer and scene scraping * Add studio scraping * Refactor code * Fix compile error * Don't overwrite performer URL during a scrape	2020-01-04 11:39:33 -05:00
WithoutPants	f52db4f58b	Add stash scraper type (#269 ) * Add stash scraper type * Add graphql client to vendor * Embed stash credentials in URL * Fill URL from scraped scene * Nil IDs returned from remote stash * Nil check	2019-12-20 19:13:23 -05:00
WithoutPants	92837fe1f7	Add scene metadata scraping functionality (#236 ) * Add scene scraping functionality * Adapt to changed scraper config	2019-12-15 20:35:34 -05:00
WithoutPants	50784025f2	Change scraper config to yaml (#256 )	2019-12-12 14:27:44 -05:00
WithoutPants	17247060b6	Generic performer scrapers (#203 ) * Generalise scraper API * Add script performer scraper * Fixes from testing * Add context to scrapers and generalise * Add scraping performer from URL * Add error handling * Move log to debug * Add supported scrape types	2019-11-18 21:49:05 -05:00
bill	9f6888a3d6	fix freeones scraper bugs	2019-10-16 02:05:49 +03:00
Friendly C	7c94262020	Freeones Scrape: Fix scraping by alias	2019-10-10 23:56:06 +02:00
bnkai	bcc70af7e5	Fix minor freeones scraper bug (#41 ) Fix minor freeones scraper bug	2019-04-11 11:54:38 -07:00
Stash Dev	b488c1ed7d	Reorg	2019-02-14 15:42:52 -08:00

1 2 3 4 5

215 commits