Commit graph

31 commits

Author SHA1 Message Date
xfnw
34b11fa5af fix mdash not being htmldecoded 2021-12-11 12:50:41 -05:00
xfnw
9acb68d457 automatic focus of search bar 2021-10-27 19:03:53 -04:00
f10cd8d3fe rename LICENSE 2021-10-22 16:12:28 -04:00
xfnw
e9b6f113d9 limit filesize of crawls 2021-10-22 11:43:33 -04:00
xfnw
f23fdaad28 use sqlite's FTS5 as the ranking algorithm 2021-10-20 16:41:36 -04:00
xfnw
4643aeec70 increase crawling timeout 2021-10-20 15:19:56 -04:00
xfnw
34e9d6b8ca add wildcards to gitignore 2021-10-20 15:18:29 -04:00
xfnw
cbb1173184 oops remove debug print 2021-06-24 14:08:08 -04:00
xfnw
9dcc667bc1 ignore script and style tags from content, and make newlines into spaces so words are not combined 2021-06-24 14:02:37 -04:00
xfnw
9a2ff10616 fix the double-escaping of htmlspecialchars 2021-06-24 14:00:33 -04:00
xfnw
826e3c2b7c delete page before downloading new one, so dead pages do not sit in the database 2021-01-27 20:33:25 -05:00
xfnw
c6b476c4a3 crawl slower so sites wont ban it lol 2021-01-16 13:39:46 -05:00
xfnw
3e051c0feb better crawl.php logging 2021-01-08 17:35:14 -05:00
xfnw
459c295488 better crawl.php logging 2021-01-08 16:54:16 -05:00
xfnw
3fdad57505 Merge branch 'main' of github.com:xfnw/searpl into main 2020-12-31 21:07:10 -05:00
xfnw
d4d5e67da8 uniquely sort urls and put the db the gitignore 2020-12-31 21:05:50 -05:00
8098d8d1af
no more cloudflare 2020-12-29 16:40:36 -05:00
xfnw
036e3addb2 re-crawl sites 2020-12-22 10:13:45 -05:00
xfnw
8fa66770b1 dont use a cdn for one little search icon 2020-12-16 21:42:08 -05:00
xfnw
2fb507356e some index stats and instructions 2020-12-16 11:36:46 -05:00
xfnw
96a12ecd4b change scoring system to discourage long feeds of keywords 2020-12-16 10:58:08 -05:00
xfnw
6a1b3d66d8 make sorting case-insensitive 2020-12-15 21:02:31 -05:00
xfnw
be5b250319 go through down sites faster 2020-12-15 12:33:00 -05:00
6b7002417d
fix the checkboxes in the readme 2020-12-15 11:02:24 -05:00
xfnw
5eb94a6834 some instructions 2020-12-15 10:59:23 -05:00
xfnw
2b46e8899f set a useragent 2020-12-15 10:17:28 -05:00
xfnw
8c4421108b dont track svgs and drop the / from the end of urls so they wont be duplicated 2020-12-15 09:59:19 -05:00
xfnw
75be84f895 fix tagging, oops lol 2020-12-14 22:07:22 -05:00
xfnw
79ba217da2 sort them 2020-12-14 17:50:06 -05:00
xfnw
c62b48ae1f unique urls 2020-12-14 17:22:28 -05:00
xfnw
69de9f49dc working 2020-12-14 16:59:16 -05:00