regex - Google Sheets Formula for Extracting Domain From Website?

Question

Welcome To Ask or Share your Answers For Others

regex - Google Sheets Formula for Extracting Domain From Website?

asked Oct 17, 2021 in Technique[技术] by 深蓝 (71.8m points)

REGEXEXTRACT(A1:A,"(?m)http(?:s?)://.*?([^./]+?.[^.]+?)(?:/|$)")

Trying to extract domain from website

The formula above has worked for me if the link is like this: https://walmart.com/careers

However, it doesn't work if it's already a domain (walmart.com) or if it's www.walmart.com/careers

Is there a more thorough formula that can allow for these edge cases?

See Question&Answers more detail:os

与恶龙缠斗过久,自身亦成为恶龙；凝视深渊过久,深渊将回以凝视…

718 views

1 Answer

深蓝 · Answer 1 · 2021-10-17T03:08:55+0000

try:

=ARRAYFORMULA(INDEX(SPLIT(REGEXREPLACE(A8:A12, "https?://www.|https?://|www.", ), "/"),,1))

UPDATE 1:

=ARRAYFORMULA(IFNA(REGEXEXTRACT(INDEX(SPLIT(
 REGEXREPLACE(A8:A14, "https?://www.|https?://|www.", ), "/"),,1), 
 ".(.+..+)"), INDEX(SPLIT(
 REGEXREPLACE(A8:A14, "https?://www.|https?://|www.", ), "/"),,1)))

UPDATE 2:

=INDEX(IFERROR(REGEXEXTRACT(A1:A, "^(?:https?://)?(?:ftp://)?(?:www.)?([^/]+)")))

Categories

regex - Google Sheets Formula for Extracting Domain From Website?

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

UPDATE 1:

UPDATE 2:

Please log in or register to add a comment.

Just Browsing Browsing

Most popular tags