mobilizon.chapril.org-mobil.../lib/service/formatter/html.ex

# Portions of this file are derived from Pleroma:
# Copyright © 2017-2019 Pleroma Authors <https://pleroma.social>
# SPDX-License-Identifier: AGPL-3.0-only
# Upstream: https://git.pleroma.social/pleroma/pleroma/blob/develop/lib/pleroma/html.ex

defmodule Mobilizon.Service.Formatter.HTML do
  @moduledoc """
  Service to filter tags out of HTML content.
  """

  alias FastSanitize.Sanitizer

  alias Mobilizon.Service.Formatter.{DefaultScrubbler, OEmbed}

  def filter_tags(html), do: Sanitizer.scrub(html, DefaultScrubbler)

  @spec strip_tags(String.t()) :: String.t() | no_return()
  def strip_tags(html) do
    case FastSanitize.strip_tags(html) do
      {:ok, html} ->
        HtmlEntities.decode(html)

      _ ->
        raise "Failed to filter tags"
    end
  end

  @doc """
  Inserts a space before tags closing so that words are not attached once tags stripped

  `<h1>test</h1>next` thing becomes `test next` instead of `testnext`
  """
  @spec strip_tags_and_insert_spaces(String.t()) :: String.t()
  def strip_tags_and_insert_spaces(html) when is_binary(html) do
    html
    |> String.replace("><", "> <")
    |> strip_tags()
  end

  def strip_tags_and_insert_spaces(html), do: html

  def filter_tags_for_oembed(html), do: Sanitizer.scrub(html, OEmbed)
end
Split Federation as separate context 2020-01-22 02:14:42 +01:00			`# Portions of this file are derived from Pleroma:`
			`# Copyright © 2017-2019 Pleroma Authors <https://pleroma.social>`
			`# SPDX-License-Identifier: AGPL-3.0-only`
			`# Upstream: https://git.pleroma.social/pleroma/pleroma/blob/develop/lib/pleroma/html.ex`

			`defmodule Mobilizon.Service.Formatter.HTML do`
			`@moduledoc """`
			`Service to filter tags out of HTML content.`
			`"""`

Introduce group basic federation, event new page and notifications Signed-off-by: Thomas Citharel <tcit@tcit.fr> 2020-02-18 08:57:00 +01:00			`alias FastSanitize.Sanitizer`
Split Federation as separate context 2020-01-22 02:14:42 +01:00
Introduce group basic federation, event new page and notifications Signed-off-by: Thomas Citharel <tcit@tcit.fr> 2020-02-18 08:57:00 +01:00			`alias Mobilizon.Service.Formatter.{DefaultScrubbler, OEmbed}`
Split Federation as separate context 2020-01-22 02:14:42 +01:00
Introduce group basic federation, event new page and notifications Signed-off-by: Thomas Citharel <tcit@tcit.fr> 2020-02-18 08:57:00 +01:00			`def filter_tags(html), do: Sanitizer.scrub(html, DefaultScrubbler)`

Various refactoring and typespec improvements Signed-off-by: Thomas Citharel <tcit@tcit.fr> 2021-09-24 16:46:42 +02:00			`@spec strip_tags(String.t()) :: String.t() \| no_return()`
Drop HTMLSanitizeEx and fix title sanitizing Signed-off-by: Thomas Citharel <tcit@tcit.fr> 2020-06-24 16:33:59 +02:00			`def strip_tags(html) do`
			`case FastSanitize.strip_tags(html) do`
			`{:ok, html} ->`
Decode HTML entities when sanitized Signed-off-by: Thomas Citharel <tcit@tcit.fr> 2021-03-29 19:26:49 +02:00			`HtmlEntities.decode(html)`
Drop HTMLSanitizeEx and fix title sanitizing Signed-off-by: Thomas Citharel <tcit@tcit.fr> 2020-06-24 16:33:59 +02:00
			`_ ->`
			`raise "Failed to filter tags"`
			`end`
			`end`

Insert spaces before stripping HTML when inserting search data Signed-off-by: Thomas Citharel <tcit@tcit.fr> 2020-07-31 11:19:42 +02:00			`@doc """`
			`Inserts a space before tags closing so that words are not attached once tags stripped`

			`<h1>test</h1>next` thing becomes `test next` instead of `testnext`
			`"""`
			`@spec strip_tags_and_insert_spaces(String.t()) :: String.t()`
Allow to refresh instance outbox when they accept subscription Signed-off-by: Thomas Citharel <tcit@tcit.fr> 2020-09-02 08:59:59 +02:00			`def strip_tags_and_insert_spaces(html) when is_binary(html) do`
Insert spaces before stripping HTML when inserting search data Signed-off-by: Thomas Citharel <tcit@tcit.fr> 2020-07-31 11:19:42 +02:00			`html`
[Metadata] Fix actors not sanitizing their description and refactor Signed-off-by: Thomas Citharel <tcit@tcit.fr> 2020-11-17 15:45:08 +01:00			`\|> String.replace("><", "> <")`
Insert spaces before stripping HTML when inserting search data Signed-off-by: Thomas Citharel <tcit@tcit.fr> 2020-07-31 11:19:42 +02:00			`\|> strip_tags()`
			`end`

Allow to refresh instance outbox when they accept subscription Signed-off-by: Thomas Citharel <tcit@tcit.fr> 2020-09-02 08:59:59 +02:00			`def strip_tags_and_insert_spaces(html), do: html`

Introduce group basic federation, event new page and notifications Signed-off-by: Thomas Citharel <tcit@tcit.fr> 2020-02-18 08:57:00 +01:00			`def filter_tags_for_oembed(html), do: Sanitizer.scrub(html, OEmbed)`
Split Federation as separate context 2020-01-22 02:14:42 +01:00			`end`