Search
Close this search box.

We are creating some awesome events for you. Kindly bear with us.

Ohio State University Empowering Accessibility for All

Getting your Trinity Audio player ready...

Researchers at The Ohio State University have pioneered the initiative to enhance internet accessibility for individuals with disabilities. In a world where the internet has become an intricately woven fabric of society, its complexity poses challenges, especially for those with disabilities.

Image credits: news.osu.edu

The team, led by Yu Su, an assistant professor of computer science and engineering, is developing an artificial intelligence (AI) agent capable of executing complex tasks on any website using simple language commands, simplifying digital interactions.

As the internet has evolved over the last three decades, its intricacies have grown exponentially. Yu Su highlighted the need to address this complexity, particularly for individuals with disabilities, stating, “For some people, especially those with disabilities, it’s not easy for them to browse the internet. We rely increasingly on the computing world in our daily life and work, but there are increasingly a lot of barriers to that access, which, to some degree, widens the disparity.”

The team presented their innovative work at a conference for AI and machine learning research. Their approach leveraged large language models to create web agents—online AI helpers—that mimic human behaviour when browsing the web. The AI agent demonstrated an ability to understand the layout and functionality of different websites using language processing, a testament to the power of large language models.

A pivotal aspect of their research is the creation of Mind2Web, the first dataset specifically designed for generalist web agents. Unlike previous efforts focused on simulated websites, Mind2Web embraces the dynamic and complex nature of real-world websites. It underscored the agent’s capacity to generalise, even when faced with entirely new websites. The team collected over 2,000 open-ended tasks from 137 different real-world websites, providing diverse challenges for training the AI agent.

Tasks included in the dataset range from booking international flights and following celebrity accounts to browsing specific genres of films on streaming platforms. The versatility showcased by the AI agent opens up new possibilities for future models to navigate and learn autonomously across various websites.

The success of this research is partly attributed to the recent development of large language models. The large language model has been widely used to generate content automatically, spanning poetry, jokes, cooking advice, and even medical diagnoses. However, the challenge lies in processing a single website’s vast information, as one can contain thousands of raw HTML elements.

To address this challenge, the researchers introduced a framework called MindAct. This framework utilises a two-pronged agent combining small and large language models to carry out complex tasks. The results show that MindAct outperforms other common modelling strategies and effectively understands various concepts.

While the potential of this AI agent to simplify internet interactions and enhance accessibility is evident, the study also highlights ethical concerns. The ability of the model to translate online instructions into real-world actions raises the possibility of misuse, from manipulating financial information to spreading misinformation. Yu Su emphasises the need for caution, stating, “We should be extremely cautious about these factors and make a concerted effort to try to mitigate them.”

As AI research progresses, Su anticipated growth in generalist web agents’ commercial use and performance. Despite the potential risks, he sees the real value of these tools in saving time and making seemingly impossible tasks possible.

The research received support from the National Science Foundation, the U.S. Army Research Lab, and the Ohio Supercomputer Centre. The collaborative effort involved co-authors Xiang Deng, Yu Gu, Boyuan Zheng, Shijie Chen, Samuel Stevens, Boshi Wang, and Huan Sun, all from Ohio State. As the digital landscape evolves, the delicate balance between innovation and responsible use of advanced AI technologies will play a crucial role in shaping the future of digital accessibility.

PARTNER

Qlik’s vision is a data-literate world, where everyone can use data and analytics to improve decision-making and solve their most challenging problems. A private company, Qlik offers real-time data integration and analytics solutions, powered by Qlik Cloud, to close the gaps between data, insights and action. By transforming data into Active Intelligence, businesses can drive better decisions, improve revenue and profitability, and optimize customer relationships. Qlik serves more than 38,000 active customers in over 100 countries.

PARTNER

CTC Global Singapore, a premier end-to-end IT solutions provider, is a fully owned subsidiary of ITOCHU Techno-Solutions Corporation (CTC) and ITOCHU Corporation.

Since 1972, CTC has established itself as one of the country’s top IT solutions providers. With 50 years of experience, headed by an experienced management team and staffed by over 200 qualified IT professionals, we support organizations with integrated IT solutions expertise in Autonomous IT, Cyber Security, Digital Transformation, Enterprise Cloud Infrastructure, Workplace Modernization and Professional Services.

Well-known for our strengths in system integration and consultation, CTC Global proves to be the preferred IT outsourcing destination for organizations all over Singapore today.

PARTNER

Planview has one mission: to build the future of connected work. Our solutions enable organizations to connect the business from ideas to impact, empowering companies to accelerate the achievement of what matters most. Planview’s full spectrum of Portfolio Management and Work Management solutions creates an organizational focus on the strategic outcomes that matter and empowers teams to deliver their best work, no matter how they work. The comprehensive Planview platform and enterprise success model enables customers to deliver innovative, competitive products, services, and customer experiences. Headquartered in Austin, Texas, with locations around the world, Planview has more than 1,300 employees supporting 4,500 customers and 2.6 million users worldwide. For more information, visit www.planview.com.

SUPPORTING ORGANISATION

SIRIM is a premier industrial research and technology organisation in Malaysia, wholly-owned by the Minister​ of Finance Incorporated. With over forty years of experience and expertise, SIRIM is mandated as the machinery for research and technology development, and the national champion of quality. SIRIM has always played a major role in the development of the country’s private sector. By tapping into our expertise and knowledge base, we focus on developing new technologies and improvements in the manufacturing, technology and services sectors. We nurture Small Medium Enterprises (SME) growth with solutions for technology penetration and upgrading, making it an ideal technology partner for SMEs.

PARTNER

HashiCorp provides infrastructure automation software for multi-cloud environments, enabling enterprises to unlock a common cloud operating model to provision, secure, connect, and run any application on any infrastructure. HashiCorp tools allow organizations to deliver applications faster by helping enterprises transition from manual processes and ITIL practices to self-service automation and DevOps practices. 

PARTNER

IBM is a leading global hybrid cloud and AI, and business services provider. We help clients in more than 175 countries capitalize on insights from their data, streamline business processes, reduce costs and gain the competitive edge in their industries. Nearly 3,000 government and corporate entities in critical infrastructure areas such as financial services, telecommunications and healthcare rely on IBM’s hybrid cloud platform and Red Hat OpenShift to affect their digital transformations quickly, efficiently and securely. IBM’s breakthrough innovations in AI, quantum computing, industry-specific cloud solutions and business services deliver open and flexible options to our clients. All of this is backed by IBM’s legendary commitment to trust, transparency, responsibility, inclusivity and service.