Home
/
Blog
/
Tech Assessment
/
Data Visualization for Beginners-Part 3

Data Visualization for Beginners-Part 3

Author
Shubham Gupta
Calendar Icon
July 9, 2018
Timer Icon
3 min read
Share

Explore this post with:

Bonjour! Welcome to another part of the series on data visualization techniques. In the previous two articles, we discussed different data visualization techniques that can be applied to visualize and gather insights from categorical and continuous variables. You can check out the first two articles here:

In this article, we’ll go through the implementation and use of a bunch of data visualization techniques such as heat maps, surface plots, correlation plots, etc. We will also look at different techniques that can be used to visualize unstructured data such as images, text, etc.

 ### Importing the required libraries   
 import pandas as pd   
 import numpy as np  
 import seaborn as sns   
 import matplotlib.pyplot as plt   
 import plotly.plotly as py  
 import plotly.graph_objs as go  
 %matplotlib inline  

Heatmaps

A heat map(or heatmap) is a two-dimensional graphical representation of the data which uses colour to represent data points on the graph. It is useful in understanding underlying relationships between data values that would be much harder to understand if presented numerically in a table/ matrix.

### We can create a heatmap by simply using the seaborn library.   
 sample_data = np.random.rand(8, 12)  
 ax = sns.heatmap(sample_data)  
Heatmaps, seaborn, python, matplot, data visualization
Fig 1. Heatmap using the seaborn library

Let’s understand this using an example. We’ll be using the metadata from Deep Learning 3 challenge. Link to the dataset. Deep Learning 3 challenged the participants to predict the attributes of animals by looking at their images.

 ### Training metadata contains the name of the image and the corresponding attributes associated with the animal in the image.  
 train = pd.read_csv('meta-data/train.csv')  
 train.head()  

We will be analyzing how often an attribute occurs in relationship with the other attributes. To analyze this relationship, we will compute the co-occurrence matrix.

 ### Extracting the attributes  
 cols = list(train.columns)  
 cols.remove('Image_name')  
 attributes = np.array(train[cols])  
 print('There are {} attributes associated with {} images.'.format(attributes.shape[1],attributes.shape[0]))  
 Out: There are 85 attributes associated with 12,600 images.  
 # Compute the co-occurrence matrix  
 cooccurrence_matrix = np.dot(attributes.transpose(), attributes)  
 print('\n Co-occurrence matrix: \n', cooccurrence_matrix)  
 Out: Co-occurrence matrix:   
  [[5091 728 797 ... 3797 728 2024]  
  [ 728 1614  0 ... 669 1614 1003]  
  [ 797  0 1188 ... 1188  0 359]  
  ...  
  [3797 669 1188 ... 8305 743 3629]  
  [ 728 1614  0 ... 743 1933 1322]  
  [2024 1003 359 ... 3629 1322 6227]]  
 # Normalizing the co-occurrence matrix, by converting the values into a matrix  
 # Compute the co-occurrence matrix in percentage  
 #Reference:https://stackoverflow.com/questions/20574257/constructing-a-co-occurrence-matrix-in-python-pandas/20574460  
 cooccurrence_matrix_diagonal = np.diagonal(cooccurrence_matrix)  
 with np.errstate(divide = 'ignore', invalid='ignore'):  
   cooccurrence_matrix_percentage = np.nan_to_num(np.true_divide(cooccurrence_matrix, cooccurrence_matrix_diagonal))  
 print('\n Co-occurrence matrix percentage: \n', cooccurrence_matrix_percentage)  

We can see that the values in the co-occurrence matrix represent the occurrence of each attribute with the other attributes. Although the matrix contains all the information, it is visually hard to interpret and infer from the matrix. To counter this problem, we will use heat maps, which can help relate the co-occurrences graphically.

 fig = plt.figure(figsize=(10, 10))  
 sns.set(style='white')  
 # Draw the heatmap with the mask and correct aspect ratio   
 ax = sns.heatmap(cooccurrence_matrix_percentage, cmap='viridis', center=0, square=True, linewidths=0.15, cbar_kws={"shrink": 0.5, "label": "Co-occurrence frequency"}, )  
 ax.set_title('Heatmap of the attributes')  
 ax.set_xlabel('Attributes')  
 ax.set_ylabel('Attributes')  
 plt.show()  
Heatmap, data visualization, python, co occurence, seaborn
Fig 2. Heatmap of the co-occurrence matrix indicating the frequency of occurrence of one attribute with other

Since the frequency of the co-occurrence is represented by a colour pallet, we can now easily interpret which attributes appear together the most. Thus, we can infer that these attributes are common to most of the animals.

Machine learning challenge, ML challenge

Choropleth

Choropleths are a type of map that provides an easy way to show how some quantity varies across a geographical area or show the level of variability within a region. A heat map is similar but doesn’t include geographical boundaries. Choropleth maps are also appropriate for indicating differences in the distribution of the data over an area, like ownership or use of land or type of forest cover, density information, etc. We will be using the geopandas library to implement the choropleth graph.

We will be using choropleth graph to visualize the GDP across the globe. Link to the dataset.

 # Importing the required libraries  
 import geopandas as gpd   
 from shapely.geometry import Point  
 from matplotlib import cm  
 # GDP mapped to the corresponding country and their acronyms  
 df =pd.read_csv('GDP.csv')  
 df.head()  
COUNTRY GDP (BILLIONS) CODE
0 Afghanistan 21.71 AFG
1 Albania 13.40 ALB
2 Algeria 227.80 DZA
3 American Samoa 0.75 ASM
4 Andorra 4.80 AND
### Importing the geometry locations of each country on the world map  
 geo = gpd.read_file(gpd.datasets.get_path('naturalearth_lowres'))[['iso_a3', 'geometry']]  
 geo.columns = ['CODE', 'Geometry']  
 geo.head()  
# Mapping the country codes to the geometry locations  
 df = pd.merge(df, geo, left_on='CODE', right_on='CODE', how='inner')  
 #converting the dataframe to geo-dataframe  
 geometry = df['Geometry']  
 df.drop(['Geometry'], axis=1, inplace=True)  
 crs = {'init':'epsg:4326'}  
 geo_gdp = gpd.GeoDataFrame(df, crs=crs, geometry=geometry)  
 ## Plotting the choropleth  
 cpleth = geo_gdp.plot(column='GDP (BILLIONS)', cmap=cm.Spectral_r, legend=True, figsize=(8,8))  
 cpleth.set_title('Choropleth Graph - GDP of different countries')  
choropleth maps, choropleth graphs, data visualization techniques, python, big data, machine learning
Fig 3. Choropleth graph indicating the GDP according to geographical locations

Surface plot

Surface plots are used for the three-dimensional representation of the data. Rather than showing individual data points, surface plots show a functional relationship between a dependent variable (Z) and two independent variables (X and Y).

It is useful in analyzing relationships between the dependent and the independent variables and thus helps in establishing desirable responses and operating conditions.

 from mpl_toolkits.mplot3d import Axes3D  
 from matplotlib.ticker import LinearLocator, FormatStrFormatter  
 # Creating a figure  
 # projection = '3d' enables the third dimension during plot  
 fig = plt.figure(figsize=(10,8))  
 ax = fig.gca(projection='3d')  
 # Initialize data   
 X = np.arange(-5,5,0.25)  
 Y = np.arange(-5,5,0.25)  
 # Creating a meshgrid  
 X, Y = np.meshgrid(X, Y)  
 R = np.sqrt(np.abs(X**2 - Y**2))  
 Z = np.exp(R)  
 # plot the surface   
 surf = ax.plot_surface(X, Y, Z, cmap=cm.GnBu, antialiased=False)  
 # Customize the z axis.  
 ax.zaxis.set_major_locator(LinearLocator(10))  
 ax.zaxis.set_major_formatter(FormatStrFormatter('%.02f'))  
 ax.set_title('Surface Plot')  
 # Add a color bar which maps values to colors.  
 fig.colorbar(surf, shrink=0.5, aspect=5)  
 plt.show()  

One of the main applications of surface plots in machine learning or data science is the analysis of the loss function. From a surface plot, we can analyze how the hyperparameters affect the loss function and thus help prevent overfitting of the model.

python, 3d plot, machine learning, data visualization, machine learning, loss function, gradient descent, big data
Fig 4. Surface plot visualizing the dependent variable w.r.t the independent variables in 3-dimensions

Visualizing high-dimensional datasets

Dimensionality refers to the number of attributes present in the dataset. For example, consumer-retail datasets can have a vast amount of variables (e.g. sales, promos, products, open, etc.). As a result, visually exploring the dataset to find potential correlations between variables becomes extremely challenging.

Therefore, we use a technique called dimensionality reduction to visualize higher dimensional datasets. Here, we will focus on two such techniques :

  • Principal Component Analysis (PCA)
  • T-distributed Stochastic Neighbor Embedding (t-SNE)

Principal Component Analysis (PCA)

Before we jump into understanding PCA, let’s review some terms:

  • Variance: Variance is simply the measure of the spread or extent of the data. Mathematically, it is the average squared deviation from the mean position.varaince, PCA, prinicipal component analysis
  • Covariance: Covariance is the measure of the extent to which corresponding elements from two sets of ordered data move in the same direction. It is the measure of how two random variables vary together. It is similar to variance, but where variance tells you the extent of one variable, covariance tells you the extent to which the two variables vary together. Mathematically, it is defined as:

A positive covariance means X and Y are positively related, i.e., if X increases, Y increases, while negative covariance means the opposite relation. However, zero variance means X and Y are not related.

PCA, Principal Component Analysis , dimension reduction, python, machine learning, big data, image classification
Fig 5. Different types of covariance

PCA is the orthogonal projection of data onto a lower-dimension linear space that maximizes variance (green line) of the projected data and minimizes the mean squared distance between the data point and the projects (blue line). The variance describes the direction of maximum information while the mean squared distance describes the information lost during projection of the data onto the lower dimension.

Thus, given a set of data points in a d-dimensional space, PCA projects these points onto a lower dimensional space while preserving as much information as possible.

 principal component analysis, machine learning, dimension reduction technqieus, data visualization techniques, deep learning, ICA, PCA
Fig 6. Illustration of principal component analysis

In the figure, the component along the direction of maximum variance is defined as the first principal axis. Similarly, the component along the direction of second maximum variance is defined as the second principal component, and so on. These principal components are referred to the new dimensions carrying the maximum information.

 # We will use the breast cancer dataset as an example  
 # The dataset is a binary classification dataset  
 # Importing the dataset  
 from sklearn.datasets import load_breast_cancer  
 data = load_breast_cancer()  
 X = pd.DataFrame(data=data.data, columns=data.feature_names) # Features   
 y = data.target # Target variable   
 # Importing PCA function  
 from sklearn.decomposition import PCA  
 pca = PCA(n_components=2) # n_components = number of principal components to generate  
 # Generating pca components from the data  
 pca_result = pca.fit_transform(X)  
 print("Explained variance ratio : \n",pca.explained_variance_ratio_)  
 Out: Explained variance ratio :   
  [0.98204467 0.01617649]  

We can see that 98% (approx) variance of the data is along the first principal component, while the second component only expresses 1.6% (approx) of the data.

 # Creating a figure   
 fig = plt.figure(1, figsize=(10, 10))  
 # Enabling 3-dimensional projection   
 ax = fig.gca(projection='3d')  
 for i, name in enumerate(data.target_names):  
   ax.text3D(np.std(pca_result[:, 0][y==i])-i*500 ,np.std(pca_result[:, 1][y==i]),0,s=name, horizontalalignment='center', bbox=dict(alpha=.5, edgecolor='w', facecolor='w'))  
 # Plotting the PCA components    
 ax.scatter(pca_result[:,0], pca_result[:, 1], c=y, cmap = plt.cm.Spectral,s=20, label=data.target_names)  
 plt.show()  
PCA, principal component analysis, pca, ica, higher dimension data, dimension reduction techniques, data visualization of higher dimensions
Fig 7. Visualizing the distribution of cancer across the data

Thus, with the help of PCA, we can get a visual perception of how the labels are distributed across given data (see Figure).

T-distributed Stochastic Neighbour Embedding (t-SNE)

T-distributed Stochastic Neighbour Embeddings (t-SNE) is a non-linear dimensionality reduction technique that is well suited for visualization of high-dimensional data. It was developed by Laurens van der Maten and Geoffrey Hinton. In contrast to PCA, which is a mathematical technique, t-SNE adopts a probabilistic approach.

PCA can be used for capturing the global structure of the high-dimensional data but fails to describe the local structure within the data. Whereas, “t-SNE” is capable of capturing the local structure of the high-dimensional data very well while also revealing global structure such as the presence of clusters at several scales. t-SNE converts the similarity between data points to joint probabilities and tries to maximize the Kullback-Leibler divergence between the joint probabilities of the low-dimensional embeddings and high-dimension data. In doing so, it preserves the original structure of the data.

 # We will be using the scikit learn library to implement t-SNE  
 # Importing the t-SNE library   
 from sklearn.manifold import TSNE  
 # We will be using the iris dataset for this example  
 from sklearn.datasets import load_iris  
 # Loading the iris dataset   
 data = load_iris()  
 # Extracting the features   
 X = data.data  
 # Extracting the labels   
 y = data.target  
 # There are four features in the iris dataset with three different labels.  
 print('Features in iris data:\n', data.feature_names)  
 print('Labels in iris data:\n', data.target_names)  
 Out: Features in iris data:  
  ['sepal length (cm)', 'sepal width (cm)', 'petal length (cm)', 'petal width (cm)']  
 Labels in iris data:  
  ['setosa' 'versicolor' 'virginica']  
 # Loading the TSNE model   
 # n_components = number of resultant components   
 # n_iter = Maximum number of iterations for the optimization.  
 tsne_model = TSNE(n_components=3, n_iter=2500, random_state=47)  
 # Generating new components   
 new_values = tsne_model.fit_transform(X)  
 labels = data.target_names  
 # Plotting the new dimensions/ components  
 fig = plt.figure(figsize=(5, 5))  
 ax = Axes3D(fig, rect=[0, 0, .95, 1], elev=48, azim=134)  
 for label, name in enumerate(labels):  
   ax.text3D(new_values[y==label, 0].mean(),  
        new_values[y==label, 1].mean() + 1.5,  
        new_values[y==label, 2].mean(), name,  
        horizontalalignment='center',  
        bbox=dict(alpha=.5, edgecolor='w', facecolor='w'))  
 ax.scatter(new_values[:,0], new_values[:,1], new_values[:,2], c=y)  
 ax.set_title('High-Dimension data visualization using t-SNE', loc='right')  
 plt.show()  
Iris data set, Tsne, data visualization of words, data visualization techniques, dimension reduction techniques, higher dimension data
Fig 8. Visualizing the feature space of the iris dataset using t-SNE

Thus, by reducing the dimensions using t-SNE, we can visualize the distribution of the labels over the feature space. We can see that in the figure the labels are clustered in their own little group. So, if we’re to use a clustering algorithm to generate clusters using the new features/components, we can accurately assign new points to a label.

Conclusion

Let’s quickly summarize the topics we covered. We started with the generation of heatmaps using random numbers and extended its application to a real-world example. Next, we implemented choropleth graphs to visualize the data points with respect to geographical locations. We moved on to implement surface plots to get an idea of how we can visualize the data in a three-dimensional surface. Finally, we used two- dimensional reduction techniques, PCA and t-SNE, to visualize high-dimensional datasets.

I encourage you to implement the examples described in this article to get a hands-on experience. Hope you enjoyed the article. Do let me know if you have any feedback, suggestions, or thoughts on this article in the comments below!

Subscribe to The HackerEarth Blog

Get expert tips, hacks, and how-tos from the world of tech recruiting to stay on top of your hiring!

Author
Shubham Gupta
Calendar Icon
July 9, 2018
Timer Icon
3 min read
Share

Hire top tech talent with our recruitment platform

Access Free Demo
Related reads

Discover more articles

Gain insights to optimize your developer recruitment process.

What AI Is Forcing HR to Rethink About Hiring

What AI is forcing HR to rethink

For recruiters and talent leaders, AI has made one thing clear: resumes can no longer be trusted as the primary signal of candidate capability. What AI is forcing HR to rethink is the entire screening stack — from how reqs are written, to how the ATS filters applicants, to how quality of hire (QoH) is measured against time-to-fill. According to LinkedIn's Future of Recruiting 2024 report, 73% of recruiters say skills-based hiring is a priority, yet most pipelines still screen on degree and employer brand at the ATS layer. That gap is where the rethink begins.

Why traditional resumes no longer predict strong hires

Resumes measure presentation more reliably than capability. Recruiters have long used job titles, company names, degrees, and years of experience as proxies for performance, but generative AI tools — ChatGPT, Teal, Rezi, and Kickresume among them — have collapsed the cost of producing a polished application. The World Economic Forum's Future of Jobs Report 2023 found that 44% of workers' core skills are expected to change by 2027, which means a resume snapshot ages faster than the role it describes.

For recruiters, the operational impact is direct: pipelines fill, screen rates rise, and yet QoH stays flat. As AI becomes more deeply embedded in hiring, HR leaders are being forced to rethink a single question:

What if resumes are no longer the best predictor of performance?

That question is reshaping recruitment faster than many organizations expected — though, as discussed later, the shift away from resumes carries its own trade-offs.

Share of Workers' Core Skills Expected to Change by 2027
Source: World Economic Forum Future of Jobs Report 2023

The resume was built for a different era

Modern work no longer fits the resume's static format. Skills evolve in months rather than years, roles overlap across functions, and professionals build expertise through online communities, freelance projects, bootcamps, and self-directed learning. According to SHRM's 2024 Talent Trends research, nearly half of HR leaders report that candidates from non-traditional backgrounds are increasingly competitive on assessments.

Resumes still reduce people to standardized timelines, and many capable candidates are filtered out by ATS rules simply because they lack the "right" employer logos. At the same time, candidates skilled in resume optimization can outperform genuinely capable professionals at the screen stage — a pattern that pre-dates AI but has been amplified by it.

It has become far easier for candidates to generate polished resumes, cover letters, and interview responses in minutes. For recruiters, the takeaway is practical: formatting and phrasing are no longer reliable proxies for capability.

AI did not break hiring — it exposed existing problems

AI did not create the resume problem; it surfaced one already present in most hiring funnels. Surveys of recruiters, including Gartner's 2024 HR research, have consistently shown three pre-AI pressures: recruiters overwhelmed by application volume, candidates optimizing resumes to pass ATS filters, and hiring managers reporting weak outcomes despite reviewing seemingly strong resumes.

AI accelerated these problems to a point where they can no longer be ignored. Many candidates can now generate a highly optimized application in seconds, and recruiters increasingly struggle to distinguish between candidates skilled at self-presentation and those who can actually do the work.

The operational shift is moving from:

"What does your resume say?"

Toward:

"Can you actually do the job?"

The rise of skills-based hiring

Skills-based hiring outperforms resume screening because it measures demonstrated capability rather than credential proximity. A growing number of organizations — including IBM, Accenture, and Delta, profiled in LinkedIn's Skills Path program — are moving toward skills-first models that prioritize practical assessments, simulations, project work, and role-specific problem-solving over employer brand or degree.

This trend is most visible in technology hiring, where coding assessments and real-world technical evaluations generally provide stronger signals than resumes alone, particularly when compared against resume-only screens for time-to-productivity. HackerEarth has run over 100 million developer assessments across enterprise hiring programs, and the consistent pattern in that dataset is that demonstrated coding performance correlates more closely with on-the-job output than degree or prior employer.

Beyond tech, a growing number of organizations are extending the model: marketing teams using campaign-brief exercises, sales teams using recorded customer-handling scenarios, and operations teams using situational judgment tests. For a deeper view of how this maps to specific roles, see our skills-based hiring guide and developer assessment platform.

Where skills-based hiring breaks down

Skills-based hiring is not without trade-offs, and recruiters evaluating it should plan for known failure modes:

  • Assessment bias. Poorly designed assessments can disadvantage career returners, caregivers, and candidates with limited test-taking time as severely as resume screens disadvantage non-traditional backgrounds.
  • Gaming of take-home tests. Unproctored coding or case exercises are increasingly solvable with generative AI, which means assessment design has to evolve in step with candidate tooling.
  • Candidate experience at scale. Long assessment batteries lower completion rates and damage employer brand, particularly for senior candidates who have multiple offers in play.
  • Legal exposure. In jurisdictions including New York City (Local Law 144) and under the EU AI Act, automated employment decision tools are subject to bias audits and disclosure requirements. Recruiters should confirm vendor compliance before deploying AI-driven scoring.

The honest read: most organizations announcing a "shift" to skills-based hiring still filter by degree at the ATS layer. The shift is real, but it is uneven.

Skills-Based Hiring Priority vs. ATS Screening Reality
Source: LinkedIn Future of Recruiting 2024; ATS screening figure illustrative based on article claims

Why HR leaders are rethinking potential

Potential is becoming more measurable in ways resumes never allowed. Traditional hiring often prioritized pedigree — familiar universities, recognizable employers, conventional career paths — but AI-powered assessment platforms (HackerEarth, HireVue, Pymetrics, Codility, and Workday Skills Cloud among them) score candidates on demonstrated performance against role-specific tasks, calibrated to a benchmark population.

These tools typically combine task-based evaluations, behavioral simulations, and structured scoring rubrics. Their limits matter too: they score what they are trained to score, they can encode bias from the training population, and they do not measure long-arc traits like cultural contribution or leadership trajectory. Recruiters should treat them as one signal in a structured interview loop, not a single decision point.

Research suggests that candidates without elite degrees frequently match or outperform credentialed peers on standardized technical assessments. In many cases, career switchers and self-taught professionals demonstrate strong adaptability and practical skill. Organizations that shift toward capability-based evaluation may gain access to broader and more diverse talent pools — though, as noted above, only if assessment design itself is audited for fairness.

The recruiter's role is changing

AI is not replacing recruiters; it is shifting where recruiters spend their time. Traditional recruitment rewarded screening volume and speed. Modern hiring increasingly rewards judgment, stakeholder alignment, and structured decision-making.

As automation handles sourcing, scheduling, resume parsing, and initial outreach, recruiters are spending more time on work AI cannot do well:

  • Probing candidate motivation through structured behavioral interviews
  • Evaluating adaptability against specific role demands using scorecards
  • Building hiring-manager alignment on the req and intake brief
  • Designing candidate-experience touchpoints that protect offer-accept rates
  • Calibrating assessment results against on-the-job performance data

The recruiter who succeeds in an AI-heavy pipeline is the one who can interpret signal, not the one who can scan resumes faster.

Candidates are changing faster than hiring systems

Modern career paths now move faster than most ATS configurations. Today's workforce values flexibility, creativity, continuous learning, and project-based growth, and many professionals build experience through freelance work, startups, creator platforms, and side projects. Their resumes often look unconventional, but unconventional no longer equates to unqualified.

Organizations that shift toward capability-based evaluation may access talent pools that rigid resume filters would otherwise miss. For practical guidance on adjusting screening criteria, see our guide to evaluating an ATS for skills-based hiring.

The future of hiring will feel more human

There is an irony in the AI shift: as resumes become easier to automate, organizations are being pushed to evaluate creativity, adaptability, collaboration, and real-world problem-solving more directly. The likely structure of mature AI-enabled hiring is AI handling repetitive tasks — sourcing, scheduling, parsing, initial scoring — while recruiters and hiring managers focus on nuance, context, and long-term fit.

FAQ

Is skills-based hiring more effective than resume screening? Skills-based hiring tends to predict on-the-job performance more reliably than resume screening for roles where the work can be assessed directly, such as engineering, data, sales, and marketing execution. According to LinkedIn's Future of Recruiting report, 73% of recruiters now prioritize skills-based approaches. Effectiveness depends heavily on assessment design and on whether downstream ATS filters still gate candidates by degree.

What HR processes is AI changing first? AI is changing sourcing, resume parsing, candidate matching, and initial assessment scoring first, because these are high-volume, rules-based tasks. Structured interviewing, offer negotiation, and onboarding remain primarily human-led, though AI-assisted note-taking and scorecard analysis are growing.

Will AI replace recruiters? AI is unlikely to replace recruiters, but it is changing the skill profile. Recruiters who can interpret assessment data, align hiring managers, and design candidate experience will be more valuable; recruiters whose role is primarily resume scanning are most exposed.

How do I evaluate an AI hiring tool for bias? Ask the vendor for a bias audit report (required under NYC Local Law 144 for automated employment decision tools), the demographic composition of the training data, the validation methodology against job performance, and the appeal process for candidates. Avoid tools that cannot answer all four.

Is resume-based hiring going away? Resume-based hiring is under pressure but not disappearing. Most organizations are moving toward hybrid models where resumes provide context and assessments provide the capability signal. A full move away from resumes is unlikely in the next hiring cycle for most enterprises.

What is the biggest risk of switching to skills-based hiring? The biggest risk is poorly designed assessments that introduce new forms of bias or damage candidate experience. A skills-based process built on a long, unproctored, untested assessment battery will perform worse than a structured resume screen.

Next steps: See it in action

If you are a recruiter or talent leader evaluating how to move from resume-led to skills-led screening, book a demo of HackerEarth Assessments to see how role-specific evaluations, proctoring, and benchmarked scoring fit into an existing ATS pipeline. For background reading, see our developer assessment platform overview and the HackerEarth recruiter blog.

Recruiters who pair structured assessment data with strong human judgment build better pipelines than either resumes or AI alone can produce.

Must-Know Recruitment Questions for HR and Talent Acquisition Teams (2026)

Recruitment questions every HR professional should know in 2025

Estimated read time: 7 minutes

Most "tell me about yourself" answers are now written by ChatGPT the night before the interview. That single shift — candidates arriving with rehearsed, AI-polished narratives — has broken the standard interview script and forced recruiters to redesign their question sets from the ground up. This guide outlines the categories of recruitment questions every HR professional should know in 2025, why each matters, and example questions you can adapt to your hiring rubric or scorecard today.

LinkedIn's 2024 Global Talent Trends report notes that skills-based hiring and behavioral assessment have moved from optional to expected in most talent acquisition workflows. Yet many hiring conversations still rely on outdated prompts that produce polished answers and unclear signals. The recruiter persona — the one running req intake, pipeline reviews, and screen calls — needs a tighter toolkit.

Who this is for: This article is written for recruiters and talent acquisition partners running structured interviews. Hiring managers building a scorecard alongside the recruiter will also find the question categories useful.

Adoption of Structured Hiring Practices Among HR Teams (2020–2025)
Source: LinkedIn Global Talent Trends claims cited in article

Why modern recruitment questions fail when they stay outdated

Industry observers at SHRM have noted that candidates are better prepared, interviews are more structured, and expectations on both sides have risen (SHRM research). With generative AI tools widely available, many candidates now enter screens with refined, rehearsed narratives.

The result is predictable — polished answers, unclear signals, and decisions made on incomplete understanding. The quality of the recruitment questions you bring into the room directly defines the quality of the signal you capture on the scorecard.

A contestable position worth stating plainly: behavioral interview frameworks like STAR are now overused to the point where candidates have memorized the structure, which reduces signal quality unless interviewers probe past the rehearsed answer with follow-ups.

What this article won't claim

Structured behavioral interviewing is not a silver bullet. Over-indexing on adaptability can screen out deep specialists whose value is stability and depth. Ownership-mindset framing, if applied rigidly, can disadvantage neurodivergent candidates or those from cultures where collective credit is the norm. Use the questions below as part of a balanced rubric — not as a single filter.

From "tell me about yourself" to understanding real intent

Traditional opening questions rarely reveal a candidate's intent or direction. A stronger opening probes why a candidate is moving at this specific point and what kind of work keeps them engaged beyond compensation.

Evidence from Gallup's 2023 State of the Global Workplace report suggests today's workforce is increasingly motivated by alignment, learning, and perceived growth — not stability alone. If this layer is missed early in the interview, the rest of the evaluation becomes less reliable.

Example intent and motivation questions

  • "Walk me through the last time you decided to leave a role. What specifically triggered the decision?"
  • "What kind of work has made you lose track of time in the last 12 months?"
  • "If this role didn't exist, what would your second-choice next move be — and why?"
  • "What would need to be true 18 months from now for you to consider this move a success?"

What to listen for

  • Specific triggers and trade-offs, not generic phrases like "growth" or "new challenges."
  • Consistency between the stated motivation and the candidate's actual career pattern.

Red flags

  • Answers that match the job description back to you almost verbatim.
  • Vague language about "culture" or "growth" with no concrete example.

Behavioral and competency-based recruitment questions: getting past scripted answers

One of the biggest challenges recruiters face today is not lack of talent, but over-prepared talent. Hiring practitioners increasingly find that well-structured, confident answers do not always reflect real capability, especially when responses are influenced by preparation tools or rehearsed narratives.

This is why competency-based questions — which explore decision-making logic, trade-offs, and real-time reasoning — produce higher signal than story-based prompts alone. For technical roles, pairing these with a practical assessment helps confirm what the interview surfaces. HackerEarth's skill assessments use role-specific question libraries and rubric-based scoring so the recruiter can compare candidate outputs against a defined standard, rather than relying on the candidate's own narrative of their capability.

Example behavioral and competency-based questions

  1. "Tell me about a decision you made in the last six months that you would make differently today. What changed your thinking?"
  2. "Describe a time you disagreed with your manager on a priority. How did you handle it?"
  3. "Walk me through a project where the scope changed mid-execution. What did you cut, and why?"
  4. "Give me an example of feedback you initially rejected but later acted on."

How to probe past the rehearsed answer

If a candidate delivers a clean STAR-format response, follow up with: "What's one detail you usually leave out of that story?" or "Who would tell that story differently?" These prompts disrupt the rehearsed structure and surface the actual reasoning.

Situational judgment and adaptability questions

Workplaces are shaped by continuous change — shifting priorities, evolving tools, and hybrid collaboration. Many hiring teams now treat adaptability as a core hiring parameter rather than a soft skill, particularly for roles where ambiguity is the default state.

Situational judgment questions present a realistic scenario and ask the candidate how they would navigate it. They are harder to rehearse than story-based prompts because the scenario is novel.

Example situational judgment questions

  • "You join the team and discover the project you were hired to lead has already slipped two months. What are your first three actions in week one?"
  • "Two stakeholders give you conflicting priorities on the same Friday. Both are senior to you. How do you handle it?"
  • "A teammate is consistently delivering work that is technically correct but late. You are not their manager. What do you do?"
  • "You realize halfway through a quarter that the metric you committed to is no longer the right one. How do you raise it?"
  • "Your top-performing team member tells you in a 1:1 they're considering leaving. They haven't told their manager. What do you do in the next 24 hours?"
  • "A vendor misses a critical deadline that puts your launch at risk. Walk me through how you decide whether to escalate, switch vendors, or absorb the delay."

What to listen for

  • Sequencing — do they ask clarifying questions before acting?
  • Trade-off awareness — do they acknowledge what they would not do?
  • Stakeholder reasoning — who do they involve, and when?

Culture and values-alignment questions

Cultural fit is often misunderstood as shared interests or personality alignment. A more useful frame is behavioral consistency with the team's working norms.

A second contestable position: generic "culture fit" questions should be retired in favor of values-alignment scenarios that name a specific behavior the company expects. "Culture fit" as a phrase invites bias; a scenario tied to a stated company value forces a more concrete answer.

Example values-alignment questions

  • "Our team gives feedback in writing before live discussion. Describe the last time you gave hard feedback. What did you write down first?"
  • "We prioritize shipping over perfection. Tell me about a time you shipped something you weren't fully proud of. What happened next?"
  • "Describe the last time you changed your mind because of data, not opinion."

For a deeper look at how culture signals show up in technical interviews, see our guide on how to design a structured technical interview.

Identifying ownership mindset over task execution

Task completion alone is no longer a strong hiring indicator for most knowledge roles. What recruiters and hiring managers increasingly screen for is the ownership mindset — how a candidate behaves when outcomes are unclear, accountability is shared, or success metrics evolve mid-execution.

A concrete scenario

Consider a Series B SaaS company hiring its first sales operations manager. The pipeline is messy, the CRM is half-implemented, and the founder is the de-facto rev-ops owner. Standard task-execution questions ("walk me through how you'd clean a pipeline") produce textbook answers. Ownership-mindset questions — "What would you stop doing in your first 30 days, and how would you tell the founder?" — surface whether the candidate can hold the seat. A strong answer names a specific thing they'd stop (e.g., "weekly pipeline reviews in their current form"), the trade-off they're willing to accept, and how they'd frame the conversation with the founder. A weak answer lists everything they'd add — new dashboards, new processes, new tooling — without naming a single thing they'd remove or a single conversation they'd own.

Example ownership questions

  • "Tell me about something you fixed that wasn't your job to fix."
  • "Describe a time the goalposts moved on you. What did you do in the first 48 hours?"
  • "What's a process you killed, and what replaced it?"

Red flags

  • Answers that always credit "the team" with no individual decision named.
  • Stories where the candidate is consistently the rescuer or always the victim.

Questions to avoid: legal and compliance boundaries

A structured question set is only as strong as its weakest prompt. In most jurisdictions, certain questions are either illegal or carry significant legal risk because they touch protected characteristics or regulated information.

Common categories to avoid in initial screens:

  • Age, date of birth, or graduation year as a proxy for age.
  • Marital status, family planning, or childcare arrangements ("Do you plan to have kids?" "Who watches your children?").
  • Citizenship or national origin beyond the legally permitted "Are you authorized to work in [country]?"
  • Religion, religious holidays, or observance schedules.
  • Disability or medical history, including questions about prior workers' compensation claims.
  • Salary history — now restricted or banned in many US states and several other jurisdictions. Ask about salary expectations instead.

For a deeper treatment of pre-employment screening practices and compliance, see our overview of pre-employment assessment design. Always confirm specifics with your legal or HR compliance partner — local law varies.

Rethinking what "good answers" actually mean

In traditional interviews, clarity and confidence were often equated with strong performance. Modern hiring increasingly challenges this assumption.

The signal you want is depth, consistency, and reasoning quality — even when responses are less polished. A candidate who says "I don't know, but here's how I'd find out" is often a stronger hire than one who delivers a fluent answer with no underlying logic.

To codify this on the scorecard, score reasoning and presentation as separate rubric lines. A candidate can score 4/5 on reasoning and 2/5 on presentation and still be a strong hire — but you will only see that if the rubric separates them.

FAQ: structured hiring questions

Which recruitment question category is most often skipped — and why does it matter?

In practice, ownership-mindset questions are the category recruiters most often skip, because they're the hardest to score consistently and the answers don't fit neatly into STAR. The cost of skipping them is high: ownership signal is what separates strong individual contributors from people who execute well only when the path is clear. If you only have time to add one new category to your interview guide, this is the one with the largest marginal lift.

What is the STAR method, and is it still useful?

STAR stands for Situation, Task, Action, Result. It is a candidate-response framework that helps structure answers to behavioral questions. It remains useful as a default structure, but because most candidates now prepare STAR-formatted stories, interviewers should probe past the rehearsed answer with follow-up questions about trade-offs, omitted details, and alternative perspectives.

How many interview question frameworks should a structured interview include?

Practitioners commonly recommend 5–8 core questions per 45-minute round, with planned follow-up probes. This is a rule of thumb rather than a sourced standard. Fewer questions with deeper probes typically produce more signal than many surface-level questions.

What is the difference between behavioral and situational judgment questions?

Behavioral questions ask about past actions ("Tell me about a time you…"). Situational judgment questions ask about hypothetical scenarios ("What would you do if…"). Behavioral questions test verified history; situational questions test reasoning on novel problems. Strong interview loops use both.

How do you reduce bias in recruitment questions?

Use a structured interview where every candidate is asked the same core questions, score answers on a defined rubric, and have at least two interviewers calibrate independently before discussing. Avoid "culture fit" as a freeform judgment; replace it with values-alignment scenarios tied to documented company behaviors.

Can skill assessments replace interview questions?

No. Assessments and interview questions answer different things. Assessments produce structured skill evaluation against a defined rubric; interview questions surface reasoning, motivation, and judgment. The strongest hiring loops pair both — skill assessments for verified capability, structured behavioral interviews for everything assessments can't measure.

Final thoughts and next steps

The recruitment questions every HR professional should know in 2025 are not a fixed list — they are a working toolkit you adapt to the role, the level, and the rubric. The categories above (intent, behavioral, situational, values-alignment, ownership) give you a structure; the example questions give you a starting point.

Next steps

  • Audit your current interview guide. Map every question to one of the five categories above. If a category is empty, add two questions.
  • Separate reasoning from presentation on your scorecard. Score them as distinct rubric lines.
  • Pair interviews with skill verification. Schedule a demo of HackerEarth Assessments to see how rubric-based skill scores integrate with your interview scorecard, so your hiring decision isn't relying on candidate self-report alone.

Sources referenced: LinkedIn Global Talent Trends, SHRM Research, Gallup State of the Global Workplace.

Why Empathy Could Be Your Biggest Hiring Advantage

Why Empathy Could Be Your Biggest Hiring Advantage

Why Human-Centered Hiring Matters More Than Ever

Hiring has never been more optimized than it is today.

From AI-powered recruitment tools to automated screening systems and structured interview workflows, HR and talent acquisition teams now have more ways than ever to improve hiring speed, consistency, and scalability.

But in the middle of this efficiency-driven approach, one critical element is slowly disappearing: employee empathy.

Empathy in hiring is not about slowing down recruitment or making decisions less objective. It is about ensuring candidates are treated like people navigating important career decisions, not just profiles moving through a hiring pipeline.

As recruitment becomes increasingly system-driven, preserving the human side of hiring is becoming both more difficult and more important.

For HR leaders and talent acquisition professionals, this is no longer just a workplace culture discussion. It directly impacts candidate experience, employer branding, hiring quality, and long-term employee retention.

When Hiring Feels Like a Process Instead of an Experience

Most modern recruitment systems are designed around efficiency.

Applications are filtered automatically, interviews are scheduled faster, and candidates move through hiring stages with minimal manual effort. Operationally, this creates speed and structure.

But from a candidate’s perspective, the experience can often feel distant and impersonal.

Many candidates go through multiple interview rounds without clear communication, feedback, or transparency about timelines and expectations. Even when the hiring process is fair, it may still feel mechanical.

This creates a growing challenge for HR and TA teams:

How do you maintain hiring efficiency without removing the human connection from recruitment?

That is where empathy becomes essential.

The Hidden Cost of Low-Empathy Hiring

The impact of low-empathy hiring is not always immediate, but it compounds over time.

Candidates remember how organizations made them feel during the recruitment process, especially during rejection or delayed communication. Those experiences shape employer perception long before someone becomes an employee.

Over time, this directly affects employer brand and candidate trust.

There is also another hidden cost.

When hiring becomes too rigid or overly process-driven, recruiters may overlook candidates with strong long-term potential simply because they do not perfectly match predefined criteria.

Without empathy, context disappears.

And when context disappears, opportunities are often missed.

For HR leaders, empathy is no longer just a soft skill. It is becoming a competitive hiring advantage.

Why Empathy Is Becoming a Competitive Hiring Skill

Today’s workforce is far more dynamic than it was a decade ago.

Professionals switch industries, build careers through unconventional paths, and learn skills outside traditional education systems. As a result, resumes and structured evaluations only tell part of the story.

Empathy helps recruiters understand what exists beyond the surface.

It allows hiring teams to better understand:

  • Career transitions
  • Employment gaps
  • Nontraditional experience
  • Personal growth journeys

This shift changes the entire hiring mindset.

Instead of asking:

“Does this candidate perfectly match the role?”

Recruiters are increasingly asking:

“What could this candidate become in the right environment?”

That perspective creates stronger and more future-focused hiring decisions.

Where Empathy Fits in Modern Recruitment

Empathy does not replace structured hiring systems.

In fact, it becomes most effective when built into them.

Simple improvements in communication can significantly improve candidate experience. Clear updates, transparent timelines, respectful rejection emails, and honest feedback all contribute to a more human-centered recruitment process.

These small changes often have a lasting impact on how candidates perceive an organization.

For HR teams, the goal is not to remove structure from hiring.

The goal is to ensure structure does not remove humanity.

Better Hiring Decisions Start With Better Human Understanding

Empathy also improves the quality of hiring decisions themselves.

When recruiters take time to understand a candidate’s context, they often uncover strengths that are not immediately visible on resumes or scorecards.

A candidate who appears average on paper may demonstrate exceptional adaptability, resilience, or problem-solving ability in real-world situations.

Without empathy, those signals are easy to miss.

For talent acquisition leaders, this means recognizing that hiring is not just about selecting the strongest profile.

It is about identifying the strongest long-term fit within a real human context.

Final Thoughts

As recruitment continues evolving through automation, AI hiring tools, and structured decision-making, the biggest risk is not losing efficiency.

It is losing humanity.

Employee empathy ensures hiring remains people-focused, even as processes become more technology-driven.

It does not slow recruitment down. Instead, it helps organizations create better candidate experiences, stronger employer brands, and more thoughtful hiring decisions.

Because candidates may forget interview questions or assessment scores.

But they will always remember how they were treated during the hiring process.

And in today’s competitive talent market, that experience often determines whether top talent chooses to join or walk away.

Top Products

Explore HackerEarth’s top products for Hiring & Innovation

Discover powerful tools designed to streamline hiring, assess talent efficiently, and run seamless hackathons. Explore HackerEarth’s top products that help businesses innovate and grow.
Frame
Hackathons
Engage global developers through innovation
Arrow
Frame 2
Assessments
AI-driven advanced coding assessments
Arrow
Frame 3
FaceCode
Real-time code editor for effective coding interviews
Arrow
Frame 4
L & D
Tailored learning paths for continuous assessments
Arrow
Get A Free Demo