Using google trends api python

7/31/2023

On this Sunday morning at 11:27am here is the search activity for the term RAPE in the past 60 minutes. If any part of our starting assumption is correct, the specificity is upsetting. I enter the term RAPE into TRENDS and set it for the past 1 hour and here is the graph. With that in mind, we will go through aggregated Google search queries believing that people who have been sexually assaulted, while they may never tell anyone, are telling Google. We only type in there what is honestly on our minds. Seth points out that while we might lie to ourselves, our significant others, our spouses, and our children, the one place we never lie is the Google search box. This project was inspired by Seth Stephens-Davidowitz’s book EVERYBODY LIES. If there are discrepancies perhaps they will give us an idea of the under reporting. Part 2 will be comparing our data with govt sex crimes data. In Part 1 we gather the daily and hourly search data for various terms that we think are related to RAPE. We will use data from Google TRENDS to try and get an idea of how prevalent sex crimes are. Writer = pd.How many sex assault crimes go unreported? #exports the three dataframes into excel and formats them. (notice how a 50 on the 0-100 scale is actually just the average weekly search volume) Calculate the real weekly search volume for each week of the period by multiplying the 0-100 index for that week by the multiplier for that term.Divide average weekly search volume by 50 to get a multiplier for that particular term.Divide average monthly search volume by 4 to get average weekly search volume.This next block is my favorite in this entire program because it gives us the actual search volumes for each week of the time period just like glory days of Google Trends. Trends.insert(0, "avg monthly volume", volumes_col) #add the search volume column to the trends dataframe #get the list of volumes from the terms that had trends (required because if a term doesn't have a trend it is removed) Like several hours to scrape one channel slow…I digressĪt the end of the code block we call function we just coded up and voilà! We have a dataframe of search trends.

This is slower than some API’s much much faster than other like the Youtube API which is hauntingly slow. The speed of this API is about average and probably take 60 seconds to scrape the trends for 100 terms. We however are more interested in the trend of the individual terms here, so this next function loops through our list of terms from our excel file earlier and runs each through our scrape_google function above and then append this trend data to a pandas dataframe called trends. PyTends, is similar in that it can get trends for a list of up to 5 keywords. When comparing multiple trends together on Google, the 0-100 values are relative to each other. Pytrends.build_payload(term, cat=0, timeframe=' ', geo='US', gprop='') Blank is regular search but you can segment it by ‘youtube’, ‘images’, ‘new’, or even ‘froogle’ for shopping.

gprop: this is the source of the search.
You can do it by country code and also by US states and some provinces. However, feel free to play around with this because you can go as far back as 2004 as well as minute by minute metrics from the past hour. I built this program around a two year interval so you might want to use that to avoid errors.
timeframe: this is the date ranges and intervals that you want to look at.
cat: this is the category that your term is related to, which may be relevant for instance if you want the trend for ‘interest’ but only in the financial sense, then you can lookup the finance code and add that here.
You have several options as far as settings go: We now need to create a function that we will take a search term as a parameter and will scrape its trend. #Get the terms and monthly search volumes if they exists in the directory Terms_df = pd.read_excel('GoogleTrendTerms.xlsx', index_col=None) Side note: if anyone knows a free source for monthly search volume definitely let me know! I got this column of monthly search volume from SemRush, but like I mentioned you definitely can just leave that column blank, you just won’t get the actual search volume.

Next, I read the excel file that i’ve added to my directory that contains a the list of keywords in one column called “Terms” and an optional second column containing actual monthly search volume of said keyword in a second column called “Volume”. In short, it sets the language and timezone parameters, along with a timeout time which will help prevent us from overdoing it will the requests and upset google. First we need to create our PyTrends object that we will use later to make the API request.

0 Comments

Using google trends api python

Leave a Reply.

Author

Archives

Categories